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Lecture I. Introduction to the Trace Formula 



The ultimate purpose of the trace formula is to better understand the structure 
of automorphic forms on a general reductive group G. In particular, one wants 
to understand the decomposition of the right regular representation Ro(g) of G(A) 
in the space of cusp forms Lq{G{F) \ G(h)). The original idea of the (Selberg) 
trace formula — roughly speaking — was to describe the trace of the integral operator 
Ro(f) — Jg(a) / (.9)Ro(.9)dg for / in C£°(c7(A)); this Selberg succeeded to do for certain 
special G. On the other hand, Arthur's trace formula is valid for arbitrary G, and 
asserts the equality of two different sums of distributions on G, namely: 

(o.i) E JJ(f) = E ■£(/)■ 

Here the left side of (0.1) is summed over certain equivalence classes in the group 
of rational points G(F); it is called the geometric side of the trace formula. On the 
right side of (0.1), the sum is over certain "cuspidal data" x = {.{M, o~)} of C7; this is 
the spectral side of the trace formula (one of whose terms is tri? (/))- 

Our purpose in these lectures is to describe and develop this formula in complete 
detail, but mostly in the context of GL(2). We also wish to describe some refinements 
of (0.1) necessary for applications to automorphic forms. The basic references are 
Arthur's original papers [Al]-[A10]. Earlier expositions of the GL(2) trace formula, 
such as [Ge] and [GJ], will be referred to occasionally; but as they mostly predate 
Arthur's general development of the trace formula, their point of view will largely be 
ignored, and their arguments not reproduced here. 

1. Some history. "The" trace formula was introduced by Selberg, in the context 
of a semi-simple Lie group G and discrete subgroup T, in his famous 1956 paper [Se]. 
In that paper Selberg first of all described a general formula in the case of compact 
quotient T \ G (see the next section below for a brief description). Secondly, he 
treated in detail certain special non-compact quotient cases such as SX 2 (z) \ «SX 2 (m); 
here Selberg analytically continued Eisenstein Series E(z, s) in order to handle the 
continuous spectrum of L 2 (T \ G), and derived a formula for the trace of R(f) in 
the space of cusp forms. In both the case of compact and non-compact quotient, 
Selberg was motivated by applications to geometry and number theory (lengths of 
geodesies in the Riemann surface in the case of compact quotient, traces of Hecke 
operators in case of non-compact quotient, etc.). But despite the obvious desirability 
of generalizing these results in the case of non-compact quotient, the obstacles — such 
as analytically continuing the Eisenstein series in higher rank — were formidable. A 
big breakthrough came in the 1960's when Langlands developed a theory of Eisen- 
stein series valid for any reductive group G, and used it to describe the continuous 
spectrum of L 2 (G(F) \ G(h)) (cf. [La 1]). As an outgrowth of this theory, Langlands 
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was led to his amazing conjectures about automorphic forms and their L-functions, 
i.e., the "Langlands program", with its cornerstone the principle of functoriality of 
automorphic forms. A typical result in this program is the following: 

Theorem 1 ([JL]). Let D be a division quaternion algebra over a number field F 
and G' = D* . Then to each automorphic cuspidal representation a of G' (i.e., an 
irreducible G'(A)-submodule of L 2 (Z(A)G'(F) \G'(A)) of dimension greater than one) 
there exists a corresponding automorphic cuspidal representation n = n(a) 0/ 67(A) = 
GIj2(a) (an irreducible G(A)-submodule of the space of cusp forms Lq(Z(A)G(F) \ 
G(A)) with the property that for each place v in F unramified for D (i.e. where 
G'{F V ) ~GU{F V )), a v ~Tr v . ' 

Interestingly enough, this correspondence (the well-known Jacquet-Langlands cor- 
respondence) can be proved using any one of the three basic tools of the theory of 
automorphic forms: the theory of automorphic L-functions, the theory of theta-series 
liftings, or the trace formula (for both G' and G = GL 2 , essentially as developed by 
Selberg). However, in this case at least, the trace formula approach gives the strongest 
payoff, namely a characterization of the image of the correspondence a — > ir(a): a 
cuspidal representation ir of G(A) is of the form n(a) for some a on a given G' D 
if and only if for each ramified prime v, n v is a square integrable (discrete series) 
representation of G v = GL^i^). 

Subsequently, a ("twisted") trace formula for GL2 was developed by Saito-Shintani 
and Langlands to prove another important instance of functoriality, namely base 
change (with spectacular applications to Artin's conjecture for Galois representations, 
and hence ultimately to Wiles' proof of Fermat ... ). Thus the need to generalize 
the trace formula to a general reductive group G was clear, and this is precisely what 
Arthur did in his papers between the early 1970's and late 1980's. 



2. The Case of Compact Quotient. In order to motivate Arthur's work, let us 
explain how the form (2) takes in the case of compact quotient. Thus G is a semi- 
simple group over (the number field) F, with G(F)\G(A) compact. In particular, G is 
anisotropic, without unipotent elements. With R(g) the right regular representation 
in L 2 (67(F) \ G(A)) and / in C%° (67(A)), a straightforward computation gives 



(R(f)<f>)(x) = f f{y)<t>{xy)dy = 



L 



f(x 1 y) ( f)(y)dy= / ( J2 f( x 1 iv)) ( t ) {y) d y^ 

G(A) JG(F)\G(A) 7gG(F) 

i.e., R(f) is an integral operator on in L 2 (67(F) \ 67(A)), with kernel 

K f(%,y)= E f( x ~ l iy)- 

jEG(F) 

Since G(F) \ 67(A) is compact, it is then a well known theorem (cf. [GGPS], Chapter 
I), that R(f) is of trace class, and that 



tri?m = / K f (x,x)dx, 
J> Jg(f)\g(k) n ' 
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the integral of the kernel over the "diagonal subgroup" G(F) \ G(A) A . In this case, 
the (Selberg) trace formula amounts to a calculation of traceR(f) using two different 
expressions for this kernel. On the one hand, we can write 

K f( x >y) = E K °( x >y) 

where D denotes the set of conjugacy classes in G(F), and 

K (x,y)= E f(x- 1 iv)= E Hx-^oSx) 

7eo={ 7 o} 5eG 70 (F)\G(F) 

where G l0 (F) is the centralizer of 70 in G(F). Then 

J (f) = / K f (x,x) dx = / V f(x~ 1 S~ 1 -f Sx) dx = 

Jg(f)\g(a) J G (F)\G(A) scg^gw 

f(x- 1 7 x)dx = meas(G lQ (F)\G 10 (A)) f((x~ l lo x)dx, 
Jg jo (f)\g(a) Jg jo (a)\g(a) 

and thus 

trfl(/) = E ■/-(/) = E^a S (G 7 (F) \ G 7 (A)) f /((aT^s) dx. 

odO { 7 } JG,,(A)\G(A) 

On the other hand, the compactness of G(F) \ G(A) also implies a discrete decom- 
position R(g) = J2n m TT 7T (9) where n runs through the irreducible "automorphic" 
representations of G(A), and m n < 00 is the multiplicity of n in R. Equivalently, 
L 2 (G(F) \ G(A)) =J2L n where L v is the invariant subspace realizing the m n copies 
of 7r, and R(f) = Rn(f) where each R n (f) is an integral operator in L v with kernel 



K*(x,y)= E (R(f)cf>)(x)cf>(y). 

<fi an ON basis forL^ 

Thus we also have 

tri2(/)= E 

xe{(G,^)} 

where 



Jy(f)— / K w (x,x)dx. 
x J> Jg(F)\G(A) v 7 



/G(F)\G(A) 

The power of the trace formula derives from the fact that this equality 

(2.1) E-W) = E 

namely 

Ym y I f(x- 1 1 x)dx = Y ( (Y(R(f)(f))(xU(x))dx 



{7} 



relates two completely different kinds of data, geometric and spectral (the latter of 
which we rarely know much about, but very much want to!). 
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3. Difficulties in the Non-Compact Case. Arthur's great contribution was to 
figure out how to overcome all the obstacles that present themselves in generalizing 
(2.1) from the compact to the non-compact quotient setting. Some of the principal 
problems in the non-compact setting are these: 

(a) Although R(f) is still an integral operator, with kernel 

7 

this kernel is no longer integrable over the diagonal; and 

(b) The regular representation R(g) no longer decomposes discretely, Eisenstein 
series are required to describe the continuous spectrum, and R(f) is of course no 
longer of trace class. Nevertheless, Arthur derives a formula (0.1) which still relates 
"geometric" and "spectral" distributions attached to G — and has powerful applica- 
tions to number theory (even if neither side of (0.1) represents a formula for tri?(/)!). 
For example, in this formula, 



Jj(f)= f Kj(x,x)dx, 

JZ(A)G(F)\G(A) 



>Z(A)G(F)\G(A) 

where Kj is a modification of the kernel 

K (x,x) = ^fix'^x) 

which can be integrated over the diagonal. To see why this modification is really 
needed, and what it accomplishes, let us look at the example of G — GL(2), and o 
the hyperbolic conjugacy class 

o = {5~ 1 («° l )5:5eM(F)\G(F)}, 
where M = {^° ^ ^ }. In this case, 

J (f) = / K (x,x)dx 

JZ(A)G(F)\G(A) 

(where Z is the center of scalar matrices). A formal computation then shows 
Uf) = I E f(x- 1 5- 1 ( a ° 1 )5x)dx 



L 
I 

Jm 



fix-H^Ax) dx 

Z(A)G(F)\G(A) 



if dy)f(x- 1 ( a °)x)dx, 

y Jz(A)M(F)\M(A) y ' JK \0lJ ' ' 



a0 )) 
Oil)' 



'M(A)\G(A) JZ(A)M(F)\M(A) 

i.e., 

J (f)=m(Z(A)M(F)\M(A))F f ([ 

with Fj(^Jj) the nice "orbital integral" Jm(a)\g(a) /( a; ~ 1 (o l) 3 ') ^ x °^ a com P ac tly 
supported function / in CZ°(Z(A)\G(A)). Thus J (f) is infinite for the simple reason 
that 

m(Z(A)M(F) \ M (A)) = m(F* \ A*) = oo. 
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To compensate for this, Arthur "modifies" K to Kj, using a truncation parameter 
T = (Ti, T 2 ) in the Lie algebra of M. As we shall see, this modification operator (in 
this case) "truncates" F* \ A* to a finite interval of length T 2 — Ti, but turns Ff into 
a "weighted" orbital integral! In general, for an arbitrary G, it will turn out that 
K will be integrable "over the diagonal" only when intersects no proper parabolic 
subgroup P of G. Of course for anisotropic G this always happens, but in general 
one really needs to modify K (as in the above example), prove that the modified 
kernels Kj are indeed integrable, and then evaluate them (as generalized weighted 
orbital integrals). 

What about the spectral side? Here it turns out that L 2 (Z(A)G(F) \ G(A)) = 
J2 x ex where each L x is a G(A)-invariant (not necessary discretely decomposable) 
submodule indexed by certain cuspidal data X = {(M, a)}, where M is a Levi sub- 
group of a parabolic P C G, and a is a cuspidal automorphic representation of M(A). 
Thus we have K(x,y) = J2 K x (x, y), the spectral decomposition of Kf(x,y) corre- 
sponding to (0.1). Here it also turns out that K x (x, y) will not be integrable over the 
diagonal, unless the cuspidal data (M, a) = (G, ir) (i.e. P = G). Thus we again need 
to deal with modified K x (which are integrable ... ), eventually arriving at the trace 
formula (0.1) by integrating the equality 

(3.1) £iC(^) = £^x>,y) 

over the diagonal. 

Remarks. (1) The expressions on either side of (3.1) no longer (generally) represent 
the kernel Kf(x, y) — ^2 f(x~ lr yy), and hence (0.1) is no longer a real trace formula at 
all. Nevertheless, what is true is the following. Let X(G) be the subset of cuspidal data 
X = (M, a) £ X with M = G. Then J2 x ex(G) Kx( x > v) * s ^ ne kernel of R(f) restricted 
to Lq(Z(a)G(F) \ G(A)), it is integrable over the diagonal, and (0.1) asserts that 

(3.2) trifc(/)= E = E " E J T X U), 

X GX(G) X\X(G) 

i.e., Arthur's trace formula may be viewed as a trace formula after all. 

(2) In most applications of the trace formula to automorphic forms we want to com- 
pare the formula for one group with the trace formula for another. In that case the 
form of (0.1) is no less useful than (3.2). 

4. Possible Applications. We have already mentioned the applications to quater- 
nion algebras and base change, both of which we plan to discuss (at the level of GL(2), 
not GL(n) - which is the subject matter of [AC], using the more difficult GL(n) trace 
formulas). We also plan to discuss (i) Jacquet's relative trace formula, where the idea 
is to integrate the (different expressions for the) kernel over interesting non-diagonal 
subgroups of G x G; and (ii) applications of the simple trace formula such as to 
the embedding of a (local) discrete series representation of GL 2 (i^) into a (global) 
cuspidal representation of GL 2 (A). In all these applications, one needs to further re- 
fine Arthur's first form of the trace formula. This is because (as already hinted) the 
"truncation" i.e., modification process used to make K or K x integrable introduces 
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various defects into the distributions Jj(f) and Jj(/). For example, they may no 
longer be invariant distributions (invariant under conjugation by G(A)), or factoriz- 
able as the product of local distributions over G(F V ). Thus one needs to replace and 
rearrange the sums of distributions appearing in (0.1) by still more complicated but 
ultimately more practical expressions. 
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Lecture II. Arthur's Modified Kernels I: The Geometric Terms 



Our purpose here is to develop the geometric side of Arthur's first form of the trace 
formula 

E JJ(f) = E 4(fl 

for the group G = GL{2). In particular, we must introduce the modified kernels 
kj(x, f), and prove they are integrable. 

N.B. The treatment of the trace formula in [Ge] and [GJ] largely ignored the 
integrability of these modified kernels. 

1. The Definition of K . In the case of compact quotient, the geometric side of 
the trace formula is summed over ordinary C7(F)-conjugacy classes o in G(F) (or 
in Z(F)\G(F), where Z is the center of G). For arbitrary G, it turns out to be 
best to make the following definition: 7! and 72 in Z(F)\G(F) are called equivalent, 
or " conjugate" , if their semi-simple components are G(F) conjugate in the usual 
sense. That is, 71 ~ 72 means, with the Jordan decompositions 7, = 7? 7" {i = 1,2), 
that 7J is C7(F)-conjugate to 7|. For each such equivalence class 0, and for / e 
C7(Z(A)\C7(A)), set 

and let D denote the collection of all such classes in Z(A)\G(A). 

Note that if 7 is elliptic, i.e. if 7 is not conjugate in G(F) to an element of any proper 
parabolic subgroup P(F), then it is automatically semi-simple. (Its eigenvalues do 
not lie in F, and hence are distinct.) Thus our new notion of conjugacy reduces to 
ordinary conjugacy. 

On the other hand, suppose 7 is C7(F)-conjugate to an element p in a proper 
parabolic subgroup P(F). For GL(2), this means P is the Borel subgroup B = 
{(H)}, so modulo Z(F), 

p = p s p u with p s =(^), and P u =(l\). 

Case (a): a = 1. In this case, the class containing 7 consists of two ordinary 
unipotent conjugacy classes in Z(F)\G(F): the trivial class {1}, and the ordinary 
conjugacy class of any non-trivial unipotent element Q^ ) (since (01) a l rea dy con_ 
jugates (J 7) to (l a *°) ). 

Case (b): a ^ 1. In this case, is an ordinary hyperbolic conjugacy class of the 
form 

o a = {S- 1 ( a ° 1 )5\5eM(F)\G(F)}. 
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Note that M(F) is just the centralizer G / a0 \ (F), and that also belongs to o a 

( §ince (o i) = o ly'i^diy 1 where y = - *) )• 

2. The Integrability of Elliptic K . As suggested in Lecture I, it is a general 
phenomenon that for an elliptic class o, the kernel K (x,y) is integrable over the 
diagonal subset Z(a)G(F)\G(a). We now prove this for G = GL(2), following the 
general arguments of [Al]. The crucial step is a lemma which first requires some 
preliminary notions to be recalled. 

According to the Iwasawa decomposition for G, we have 

(2.1) C7(A) = N(A) M (A) K, 

where N is the standard unipotent subgroup { Q i ) }' anc ^ ^ * s ^ ne standard maximal 

compact subgroup n K v of G(A). If we denote by n M the Lie algebra of M — { (q °) }> 
i.e. , 

a M = Hom(X(M),R) w K 2 , 
then we can define the familiar function 

H M :M(A)^a M by H M (l ° b ) = (log \a\ A , log |6| A ), 

and extend it to G(a) by way of the decomposition 2.1, i.e. if g = nak in G(A), then 

H M (g) = H M (a). 

Lemma 2.1 ( see [Al, p 362] ). Suppose Q is a subset of G(A) which is compact 
modulo Z(A). Then there exists a number d n > with the following property: if 
7 G G(F) is such that 

g~ l >yng e Cl 

for some n G N(A) and g G G(A) with ceH(g) > dn, then 7 G B(F). 

Assuming this Lemma (for the moment), let us quickly dispose of the integrability 
of K . For this, and for other purposes later on, we also need to recall the notion of 
a Siegel domain, together with an explicit integration formula for its volume. 

First we refine the Iwasawa decomposition 2.1 by writing, for each g in G(a), 

(2.2) g = znh t mk 
where z G Z(A), n G N(A), k G K, 

h t = (0 e-*)' 

with e* an idele equal to 1 at every finite place and the real number e* at each infinite 
place, and 

»•=(:;). 

with a G A 1 = {a G A x | \a\ A = 1}. The corresponding integration formula, for / on 
Z(A)\G(A), is 

(2.3) / f(g)dg= ((I r f(natk)e" 2t dtd x adndk. 

JZ(A)\G{A) J J J J-00 
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Now for each compact subset C\ of N(A), compact subset C2 of A 1 , and real number 
c (not necessarily positive), let & c denote the set of points g in G(A) of the form (2.2) 
with n G Ci, a G C2 and t > c/2. Such a set is called a Siegel set. Note that 

6 C C (geG(A) I H(g) > c}. 
Fact 1 (see [Go, Thm 9]). For any Ci, c 2 , 

6 C1 n 76 C2 7^ 
for only finitely many 7 in G(F) modulo B(F). 

Fact 2 (see [Go, p 16]). If & c is sufficiently large (i.e. C\ and C2 are sufficiently large, 
and c is sufficiently small), then 

(2.4) G(A) = G(F) • 6 C . 

In other words, S c is essentially a fundamental domain for G(F) acting on G(A); 
more precisely, Fact 2 says only that certain & c contain a fundamental domain. 
Henceforth, we always assume that 6 C is so chosen that 2.4 holds, and we refer to 
such a 6 C as a Siegel domain. 

Remark. From the above, it is clear that 

Z(A)G(F)\G(A) 

has finite volume. In fact, by 2.3 and 2.4, 

/ dg< I dg= [ [ [ ( r e- 2t dt) d x adndk, 

J Z(A)G(F)\G(A) JZ(A)\e c J J J \Jc J 

with the outer three integrals over compact domains. 
Proposition 2.2. If is elliptic, then 

x 1— > K (x, x) = f(x~ 1, jx) 

is absolutely integrable over Z(A)G(F)\G(A), and its integral is 

J (f)=m(Z(A)G(F)\G(A)) f f{x~ l 1X )dx. 

■/G 7 (A)\G(A) 

Proof. It suffices to prove the integrability assertion, since the ensuing formula for 
J (f) follows just as in the case of compact quotient. 

First note that for x lying in any fixed set fii, which is compact modulo Z(A), the 
sum defining K (x,x) is actually finite and hence K defines a nice, smooth function 
on Z(A)G(F)\G(A). Indeed, f(x^ 1 ^/x) ^ implies x~ 1 jx G Qf — support of /. 
Thus 7 itself must lie in a compact set modulo Z(A), namely fii^/fi^ 1 , and such a 
set can contain only finitely many 7 in Z(F)\G(F). 

On the other hand, if 7 is elliptic, then f(x~ lr jx) 7^ implies (by Lemma 2.1) that 
aH(g) < dn f . I.e., the support of 

K ci i(x,x)= f( x ~ l l x ) 

7 elliptic 
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lies in the "doubly truncated" Siegel set 

e c d = Z{k)C 1 {h t }C 2 K with c<2t<dn r 

Equivalently, the support of K e n is compact modulo Z(A), and the integrability is 
clear. □ 

Remark. The argument above actually proves that the full elliptic contribution is 
absolutely integrable, i.e. , that 



K e \\(x, x) \ dx < oo. 
Thus, by the Lebesgue dominated convergence theorem, it follows that 

J \K (x, x)\ dx < oo, 

o elliptic 

and hence that the sum 

E Uf) 

o elliptic 

converges absolutely. 

It remains now to prove Lemma 2.1. It might be tempting to do so for GL(2) 
by simply trying to multiply out some 2x2 matrices (as we did in [Ge, p 201]). 
However, this more quickly than not leads to an incomplete proof (as happened in 
fact in [Ge]), whereas the approach of [Al, p 362] works simply and generally, as we 
shall show below. 

Proof of Lemma 2.1. Let 

p : G -> G£{V) 

denote the adjoint representation of G on the space of trace zero 2x2 matrices over 
F; its highest weight vector is A = a, the unique simple root of G with respect to 
the torus M. Fix a basis {e ,ei,e 2 } of V such that 

p(a)e j = a A - ja ej for a G M (j = 0, 1, 2) 
p(n)e = e for n G N and 

p(w)e = e 2 
For £ in V(A), define 

neii = nn^L 

V 

where, for finite is, \\£i,\\ v = maxj{|£^} (where denote the coordinates of £ u 
with respect to the basis {e^}), and for infinite v, H^H^ is defined by the Hilbert 
space structure that makes {ej} into an orthonormal basis. (To make sense out of the 
infinite product, we restrict £ to be "primitive", i.e. , || \ v = 1 for all but finitely 
many is.) 

To prove the Lemma, suppose that 

(*) g^ing E Q 

with n G N(A) and compact modulo Z(h). Write g = n\ok, so that (*) implies 

a~ x n{ x ^inn\a G KVtK. 
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Because the map g i— > p(g)e is continuous, and p is trivial on Z(A), it follows that 



p(a n l ^nriia) ■ eo 



< e 



2cZ ( ) 



for some d > 0. So suppose that 7 does not belong to B(F). Then by Bruhat's 
decomposition, 

7 = 6 wn 

for some b Q in -B(F) and n in N(F). But for any 6 G 5(A) and g G C7(A), our 
assumptions on p and e imply 

\\p(gb)-e \\=e aH ^ || pQ?) • e || . 

Thus (with 7 = b wn ) we compute that 



p((a n 1 b w)(n nnia)) ■ e 



^aH(a) 



p(a n x b ) ■ e 2 



e 2aH(a) || p(n ^ . ^ 



> e 2aH{a) 



e 2 



3 2aif( S ) 



(since i?(<?) = H{a), p(a) e 2 = a a e 2 , and || p(n) e 2 || > || e 2 || for any n in N(A)). 
Thus the Lemma is proved for d n > d . □ 



3. The Definition of K^(x, y). For the (unique non-trivial) unipotent class in 
Z(F)\G(F), we saw in Lecture I that K (x,y) is not integrable over the diagonal. 
A somewhat less trivial computation (see Lecture III) shows that the hyperbolic 
K (x, y) are also not integrable. So let us finally introduce the required modifications 
of these kernels. 

For P = B, let tb denote the characteristic function of the positive Weyl chamber 
QB + = { (ri, r 2 ) G ob = a M \ n - r 2 > }, 

i.e. , the cone in R 2 where a(ri,r 2 ) = r\ — r 2 is positive. Then define, for any 061) 
and T = (Ti,T 2 ) in a B + , 

(3.1) kj(xj) = K (x,x) - J2 K B:0 (5x,5x) r B (H(5x)-T) 

SeB(F)\G(F) 

where 




This is Arthur's modified kernel kj(x, f). 

Note that if is elliptic, then fl M(F) = 0. In this case K B ^ = 0, and 



kj(xj) = K (x,x). 
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In general, the term subtracted in 3.1 from K (x,x) is the "correction term at oo" 
which ends up making kf(x, /) integrable. Before proving this is so, let us note the in- 
ductive nature of the definition of kj (thereby making more plausible the appearance 
of K B , ). Set 



(3.2) K B (x,y)=J2 K B,o(x,y) 



dn. 



■yeZ(F)\M(F) JN ^ 



Then K B (x,y) is just the kernel of R(f) acting in L 2 (Z(a)N(a)B(F)\G(A)) in place 
of L 2 (Z(a)G(F)\G(a). Indeed for <p in L 2 (Z(a)N(a)B(F)\G{A)), ' 



R(fMx)= f f f(x- 1 yMy)dy 

JZ A)\G A) 



IZ(A)\G{A) 

(3.3) = / ( E / /(z-Vy)^W)*/, 

JZ(A)N(A)B(F)\G(A) \ Z{F ^ M{F) Jn(A) J 

as claimed. In general, for an arbitrary G, the definition of kj(x, f) takes into account 
all the proper parabolics P of G, subtracting off from K (x, x) not just one term (for 
the minimal parabolic B), but the sum 

(3.4) j2 { _ 1)di mA P /z K P , (5x,5x)? P (H(5x)-T), 

PCG 8eP(F)\G(F) 



where 



K P {x,y) = K P>0 (x,y) 

is the kernel of R P (f) in L 2 ( y Z(A)Np( y A)M P (F)\G(A)). (Here A P is the maximal 
split torus in Mp, the Levi component of P = MpNp, and Rp is the right regular 
representation of G(A) in L 2 (Z(a)A^ p (a)M p (F)\C7(a)).) 

Remark. Observe that 

kj(x,f) = K (x,x). 

for i in a compact (modulo Z(A)) set Q (how large depends on T). Indeed, for x 
in an appropriate such set Q, tb(H(5x) — T) is identically zero*. Thus x is indeed 
a modification of K (x, x) only "near infinity" (and in higher rank, more than one 
parabolic is needed to effect these modifications near infinity ... ). 



* Proof: Fix c and T such that a(T) > c. Fact 1 of Section 2 implies that in 6 C , tb (H(Sx)-T) ^ 
only for finitely many classes Si in B(F)\G(F). Then for each such class, TB(H(Six) — T) = on 
the compact set Qf in & c where c < H(Six) < a(T). Thus tb(H(Sx) — T) is identically zero on 

£i T = n? =1 nf. 
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4. The Integrability of kj(x, f). Henceforth, let us assume that o is unipotent or 
hyperbolic. To prove that kj(x,f) is integrable over Z(a)G(F)\G(a), it suffices to 
show that it is integrable over a Siegel domain & c . So pick any T e ag + , and write 

6 C = B T U tf T , 

where 

i3 T = {(?G6 c | aH(g)>a{T)>0}, 

and St is its complement in 6 C . Clearly fcjf (x, f) is integrable over Bt, since 

c < < a(T). 

Thus it suffices to prove that kj(x,f) is integrable over Bt- (N.B. In general, the 
partition of a Siegel domain requires more non-trivial "reduction theory" than is 
evident in the case of GL(2); this is what leads to the "geometric" combinatorial 
lemmas in §6 of [A2].) 

Lemma 4.1. There exists a constant Co such that aH(wn) < c for all n e N(A); 
moreover, if 7 in G(F) satisfies the inequality 

aH(jx) > Co 

for some x in G(A) with aH(x) > c , it must follow that 7 e B(F). 

As the proof is similar to that of the crucial Lemma 2.1 in §2 of this Lecture, we 
leave the details to the reader (see [Al, pp 334-335]). 

Now fix T in ob + such that a(T) > cq. Then for x in Bt, Lemma 4.1 implies that 

r B (H(5x)-T)^0 

(if and) only if S e B(F). Thus (for such x), 

E /(*~ V) = E E Kx-H-^Sxfowsx) - t), 

7Go 8eB(F)\G(F) 7£" 

and hence 

kj(x,f)= E {E/(^"V^) 

SeB(F)\G(F) Ueo 

-/ E f(x^ 1 5^ 1 fin5x) dn\rB(H(Sx) — T) 
Jn ^ M eM(F)no J 

Next fix T such that a(T) > max{c , dn f }, with as in Lemma 2.1. Then by 
Lemma 2.1, we may replace the sum here over 7 e by a sum over 

B(F) n = JV(F) • M (F) n 

(This last equality holds since the class of 7 in B(F) is determined by its semi-simple 
part, i.e. its M(F)-part.) Thus we get 
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K (x, /) 



E 

S&B(F)\G{F) /j,eM(F) 



(4.1) 



E { E fix-^usx) 

vl(F)C\o lueN(F) 

-[ f(x- l 5- 1 fin5x)dn\f B (H(5x) -T) 

JN(A) ! 



EE E F Sx Ju)r B (H(Sx)-T) 

S V veN(F)-{l} 



where 



Fx,^{n) = f(x 1 finx) 
defines a Schwartz-Bruhat function on JV(A) ~ A with Fourier transform 



AT (A) 



/(x 1 /xnx) , 0Ar(ni/) (in 



(Here V'iv((oi)) = VK 2 -) f° r some fixed non-trivial character on F\A, and we have 
applied the Poisson summation formula to F$ X)M .) Finally we are ready to prove 

Proposition 4.2. For any o E O, kj(x,f) is absolutely integrable over 

Z(A)G(F)\G(A). 

Proof. We may assume o is not elliptic. By the above arguments, it suffices to check 
the integrability over Bt, with a(T) > max{c ,<in / } as above. As in that domain 
formula (4.1) holds, we conclude 



(4.2) 



dx < 



J\kJ(x,f) 

I EE 

JZ(K)B(F)\G(A) ^ M{F)noueN(F y {1} 



F x ,n(v) f B (H(x) -T)dx. 
Note that for any u = (jjj) in M(F)C\o, v = (J™) in N(F)-{1}, and x = znh t mk, 

£» = / a /(^(;;)(;?)^w^ 



e 2 ^(e 2 V) 



with fc* = (h t nh t )mk. (Here e v 



1 C M W 

o i y ' 



and we confuse N(F) with F.) Note 



also that the variable x in (4.2) runs through a Siegel set. Thus k* runs through a 
compact set, and we have, for any N > 

E^(e 2 M<Me~ 2 r 
Applying Iwasawa's decomposition (2.3) to (4.2), we then conclude that 

/ kj(x,f) dx<C [ [ I ( r e~ 2tN dt)dkd x a dn< oo 

J JN(F)\N(A) JF^W JK\Ja(T) ) 



as required. □ 
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Summing up, we have 
Theorem 4.3. For a(T) sufficiently large, 




^ JZ(A)G(F)\G(A) 



kj(x, f) dx < oo. □ 



Hence the left side J2 a <=o Jj(f) °f (1) °f Lecture I is defined and absolutely con- 
vergent. 

Indeed, it suffices to remark that for a given / e C£°(Z(a)\G(a)), kj(x,f) is 
identically zero except for finitely many hyperbolic o. This is clear from the defining 



the fixed compact mod Z(A) set Qf for only finitely many a; to see this, observe 
that conjugating by 5x preserves eigenvalues. Thus the sum over o in O in Theorem 
4.3 is infinite only over the elliptic classes, where we already know the relevant sum 
J \K (x, f) \ dx is finite (see the Remark after the proof of Proposition 2.2). 

Concluding Remarks, (a) The terms Jj(f) are indeed generalizations of the 
orbital integrals appearing on the geometric side of Selberg's compact quotient trace 
formula. Indeed, for o elliptic (which is automatic for G anisotropic), Jj(f) reduces 
to a multiple of the usual orbital integral f G (a)\g(a) /( rc ~ 1 7 ;r ) dx. 

We have seen — for GL{2) — that the kernel K (x, x) is already integrable on 6 C if o is 
an elliptic class (as opposed to hyperbolic or unipotent). For general G, the proper 
distinction is between those o which never intersect any proper parabolic subgroup 
subgroup P of G and those that do. Indeed Arthur does not talk about "elliptic" o 
at all, but rather defines a modified kernel 



for any o, and proves that kj(x,f) is integrable. But of course — as we have al- 
ready observed — kj (x, f) = K (x,x) precisely when o is elliptic in this sense (that it 
intersects no proper P . . . ). 
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Lecture III. Arthur's Modified Kernels II: The Spectral Terms 



In order to describe the right hand side of Arthur's trace formula, we need first 
to recall the spectral expansion of the kernel 

K f(x,y) = K x( x >y)- 

1. The Spectral Kernels K x . Henceforth, by a "Levi subgroup M of G" we 
understand the Levi component of a parabolic subgroup P of G; for any such M, 
we have 

M 1 (A) = {m E M(A): |A(m)| A = 1 VA e X{M) F }. 
In general, the decomposition of 

L 2 (Z(A)G(F)\G(A)) 

into right G(A)-invariant subspaces is determined by spectral data \ = {iMi r )} G 
X, where the pair (M, r) consists of a Levi subgroup M of G and a cuspidal repre- 
sentation r of Z(A)\M(A); the class {(M, r)} derives from the equivalence relation 

(M,r) - (M',r) 

if and only if M is conjugate to M' by a Weyl group element w, and r' = r w on 
Z(A)\M 1 (A). Then to each such cuspidal datum \ — {(-^i r )} is associated a 
subspace Z£, such that 

L 2 (Z(A)G(^)\G(A))=0^, 

x 

and the corresponding decomposition of kernels 

x 

is explicitly describable in terms of Eisenstein series. 

For G = GL(2), cuspidal data comes in two possible forms: 

Case (i): M = G,r = a cuspidal representation 7r of Z(A)\G(A). In this case, 
L 2 X = Lk, the irreducible subrepresentation of L 2 ) (Z(A)G(F)\G(A)) realizing w, 
and 

(1.1) K x (x,y) = K„{x,y) = ]T R(f)4>(x)W) 

{4>}— o.n. basis of 
17 
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Case (ii): M = { ( "q a 2 ) } ' r ( a 2 ) = M fe) ™^ M ° character °f FX \ Al 
(identified with the pair (M,p^ x ). In this case, 

/oo 
p(p, it)(f)(f)^,it, p) E(y, it, p) dt. 
'OO 

iff} 

where {(f)^} runs through an ON basis for the induced representation space 

Ind B(A)A* (^j =p{fi,o), 
consisting of functions (f> : G(A) — > C such that 

for all ^ in B(A), and J K \<f)(k)\ 2 dk < oo; p(p, s) denotes the induced 

representation space Indgj^j p (02) I 02 12' and ^(^' s ' m) is the Eisenstein series 

7£S(F)\G(F) 

attached to the function 
in p(p,s). 

Remarks, (i) The series defining E(g, <j)^, s, p) converges only for Re(s) > 1/2, 
but E(g) itself has a meromorphic continuation to all of C, with a possible pole at 
s = 1/2. These and other facts from the theory of Eisenstein series we shall simply 
assume, referring the reader to [GJ] for more details. 

(ii) If p? = 1, then (and only then) K x (x,y) has an additional "discrete" term, 
namely 

(1.3) (R(f)p)(x)JW) = [ f(g)p(dctg)dg p(detx)p(dety). 

JZ(A)\G(A) 

(this comes from the residue of the corresponding Eisenstein series at s = 1/2 ... ). 

(iii) Above, K x (x,y) represents the kernel of R(f) acting in the space L 2 ,, which 
(for any \ = {(Af,M)}) consists of all if(g) which are orthogonal to the space of 
cusp forms Lq(Z(A)G(F)\G(A)), and such that for almost every x in G(A), the 
projection of the function 

ip%(m) = / (p(nmx)dn, 

JN(F)\N(A) 
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onto the space L 2 (Z(A)M(F)\M 1 (A)) transforms under Z(A)\M 1 (A) as a sum or 
representations r, where (M, r) is a pair in \- 

The beauty of this description is that it generalizes to a general reductive group 
G, as we shall now describe. 

For any fixed parabolic P = M P N P in a reductive G, one may describe a de- 
composition 

(1.4) L 2 (Z(A)N P (A)M P (F)\G(A)) = L 2 (P) = L 2 (P) X , 

where L 2 (P) X consists of those <p in L 2 (Z{A)N P (A)M P (F )\G(A)) with the fol- 
lowing property: for each standard parabolic subgroup B of G, with B C P, and 
almost every x in G(A), the projection of the function 

m — > 45b.iM = / (p(nmx)dn 

Jn b (F)\N b (A) 

onto the space of cusp forms in L 2 (Z(A)Mb{F)\M b (A)) transforms under M B (A) 
as a sum of representations rs, in which (Mg, rs) is a pair in If there is no such 
pair in x, fB.x will be orthogonal to the space of cusp forms on Z(A)Mb(F)\M^(A) 
(as happens above for G = GL(2), P = G = B, and \ = {( M , A»)})- 

Remark. For any fixed x in X, let P x be the set of standard parabolics B, where 
(Mb, tb) belongs to \- Then from the theory of Eisenstein series it follows 

L 2 (Z(A)N P (A)M P (F)\G(A)) X 

will be zero unless there is a group in P x contained in P. 
Let us return now to the decomposition 

(1.4) L 2 (Z(A)N P (A)M P (F)\G(Aj) = L 2 (P) = L 2 (P) X , 

described above. If we let K PjX (x,y) denote the integral kernel of the restriction 
of R P (.f) to L 2 (P) X , then one can still — in general — write down a formula for 
K P ^ x (x,y) in terms of Eisenstein series. Clearly 

(1-5) J2K P , (x,y) = J2K PiX (x,y), 

as each side equals the integral kernel of Rp(f) (see §3 of Lecture II, where we 
computed the "geometric" expression for the kernel K P , for G = GL(2) and P = 
Borcl). 

Summing Up. We have 

Kf(x,y) = ^K x {x,y) 
and (for any parabolic P of G), 

K P {x,y) = Y, R p,x( x >y)- 

xex 
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Concluding Remarks, (a) When P = G, 

Kp(x, y) = K G (x, y) = K(x, y) 

and 

Kp, x (x,y) = K x (x,y). 

Thus the first spectral expansion is a special case of the second. 

(b) For G = GL(2), and P = B (the Borel), we have L 2 (P) X = if \ = 
(G,7r). On the other hand, if \ = {(Wp,^)}, then L 2 (P) X just consists of ip 
in L 2 (Z(A)N P (A)M P (F)\G(A), which under left action by M 1 (A) transforms ac- 
cording to a sum of ^ and /z -1 . The corresponding kernel KB, x (x,y) is given by 
the formula 

(1.6) K B , x (x,y) = K B (x,my)(ii(m) + fi w (m))dm. 

J Z(A)M(F)\M 1 (A) 

This is easily checked by computing the composition of R(f) with the projection 
operator P x defined on L 2 (Z(A)N(A)M(F)\G(A)) by 

{ P xV){y) = \ (p(my)(n(m) + fi w {m)) dm. 

J Z(h)M(F)\M 1 (A) 

Moreover, an analogue of (1.6) (without the subscript B) holds for the kernel 
K x (x,y). 

(c) The series 

E K x( x ' y) = /Z K G,x( x > v) 

and 

^K PtX {x,y) 

converge absolutely (to K(x,y) and Kp(x,y) respectively). 

(d) In general, for any G and \ = {(M, r)} 

(1.7) Y,K G>x (x,y) = 

E 7Ta~~\ E / E ^> A )(/)^' A)S(2/,0,A) dA, 

where: the sum over P is over "associated" parabolics in G, n(Ap) is the number 
of chambers in ap, a* P = X(M P ) ®R is dual to ap = Rom(X(M P ) F , WL), p(a,X) is 
the right regular representation in Ind^^j a <g) e x ^ Hp ^\ <t> runs through a suitable 
if -finite) basis for p(a, A), and — critically, a runs through (classes of) irreducible 
unitary representations of M(A) such that functions in Indp|*j a belong to L 2 (P) X . 
Note that for \ = {(G, 7r)}, (1.3) just reduces to 



4> 
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since L 2 (P) X = unless P = G. Thus our original formula for K 7r (x,y) is indeed 
consistent with the statement that any Kp a (x, y) is expressible in terms of Eisen- 
stein series (but now the Eisenstein series in question belong to <f) in L n , viewed 
as induced from themselves on P = G). At the same time, formula (1.7) is also 
general enough to include the extra term R(f)(fi)(x)n(y) in case G = GL(2), and 
X = {(M,n)} with n — indeed, the parabolic P = B in (1.7) contributes the 
leading term (1.2) to K x (x,y), while the parabolic P = G contributes the crucial 
additional term (1.3). 



2. Modified Kernels and Truncated Kernels. Recall the identity 

introduced towards the end of the last section. By complete analogy with the 
geometric side (compare formulas (3.1) and (3.4) of Lecture II), Arthur defines the 
modified spectral kernel functions 

(2.1) k T x {x, f) = (-l) di ^ A -^ Yl K p^x, Sx)t p (Hp(Sx) - T), 

PCG 8eP(F)\G(F) 

which in the case of GL(2) look like 

kT(x,f)=K x (x,x)- ]T K BtX (5x,6x)r B (H B (5x)-T). 

5eB(F)\G(F) 

Because of the identity (1.5), one clearly has 

(2.2) ^fcn*,/) = E fc x (*'/)' 

i.e., the modification of the geometric expression for the kernel of R(f) is equal to 
the modification of the spectral expression for this kernel. 

Despite its simplicity, this identity (2.2) is the starting point of Arthur's trace 
formula in the case of non-compact quotient. Of course, we emphasize (again!) 
that neither side of (2.2) represents (in general) the kernel of R(f). However, we 
have already seen in Lecture II that each of the functions kj(x,f) is (absolutely) 
integrable over Z(A)G(F)\G(A), and that the sum of the resulting distributions 

Jj(.f)= [ kj(xj)dx 

J Z(A)G(F)\G(A) 

also converges absolutely. Thus by (2.2), we have shown that 

(2.3) J2 J o(f)=l $>£(*,/) cfe, 

JZ(A)G(F)\G(A) x 



22 



STEVE GELBART 



i.e., to obtain the first form of Arthur's trace formula, we need "only" show that the 
integral (on the right side of (2.3) may be taken inside the sum over \\ m particular, 
each k x (x, /) is integrable, and the sum of the resulting distributions 

(2.4) ■£(/) = / ^(xj)dx 

converges absolutely to the left side of (2.3). 

To prove the absolute integrability of each k x (x,f), and of the sum of such 
kernels, it turns out to be necessary to introduce a truncation operator on 
G(F)\G(A). Given T in a + as before, the truncation of a continuous function 
<p(x) on Z(A)G(F)\G(A) is the function 

(2.5) A T p(x)=<p(x)~ <Pn(5x)t b (H(5x)-T), 

SeB(F)\G(F) 



where 



Pn{5x) = / ip(nx)dn 

J N (F )\N (A) 



denotes the constant term of ip. (Again, for x e 6 C , the sum over 5 is finite, by 
Fact 1 of Lecture II . . . ) In general, for arbitrary reductive G, 

(2.6) A T p(x) = ]T (-l)dim(AWZ) V N P (Sx)fp(H(Sx)-T). 

PCG 5eP(F)\G(F) 

Note that if ip(x) is cuspidal, i.e., ipN P — for every Np, then 

A. T ip = ip. 

Of course, for a general automorphic ip, this is no longer true; however, the whole 
idea of truncation is that A T ip still equals tp on a large part of & c (how large 
depends on T), and suitably modifies (p "near infinity" so as to be integrable over 
all of & c - In particular, for given "large" T, there is a compact (modZ(A)) set fi 
such that 

A T <p = ip on O. 

(Both this statement — and its proof — resembles its analogue for the modified ker- 
nels kj(x, /); see Remark 3.5 of Lecture II.) 

Now for any \ m let ^K x {x, x) denote the function obtained by truncating 
the function 

K Ga (x,y) = K x (x,y) 

with respect to the second variable and then setting y — x. The strategy Arthur 
follows in [A3] to prove that 



(2.7) J %(x, f^j dx = Y^J k T x {x, f) dx 
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is the following: Using properties of A T , he first shows that 

(2.8) / E A^k^x.x) dx — E / A% k x (x, x) dx 

J x x J 

with 



then he shows that for sufficiently large T, 

(2.9) fj2\^ k x(x,x)-k^(xj)\ dx = 0. 

J x 

From these facts it follows that J2 Jj{f) converges absolutely to Jj(/), where 
for sufficiently large T, J x (f) is given by either (absolutely convergent) integral 



J k^(x,f)dx or J AjK x (x, x) dx. 



In particular, the first form of Arthur's trace formula is established. 
Concluding Remarks. Set 

K cusp (x,y) = E K x( x >v) 
xeX(G) 

where 3E(G) denotes the set of cuspidal data {(M, a)} with M = G, and R be the 
regular representation of G(A) restricted to 

L l = (B L x- 

xex(G) 

Then for any T, 

(2.10) Af if cusp (x, y) = K cusp (x, y), 

this kernel is integrable over the diagonal, and equals to Ro{f). Thus Arthur's first 
form of the trace formula gives the more familiar trace formula 

(2-11) tri? (/) = E J ° T (/)- E J x(/)- 

oeo x\x(G) 

Here only the proof of (2.10) is elementary; it results from the observation that 
y — ► K cusp (x, y) is cuspidal (see the beginning of §4 below). The remaining facts 
are discussed in [Al], at least for the case of rank 1. To prove that tri? (/) exists 
and equals AT cusp (x, x) dx one uses arguments similar to those used in the (earlier) 
proof that / kj(x, f)dx < oo. 
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3. Proof of (2.8) and the Absolute Convergence of / A 2 K x (x,y) dx < 

x J 

oo. When Arthur described the spectral expansion 
(3.1) K{x,y) = Y,K x {x,y). 

in [A2] , he stressed its absolute convergence by way of the following: 

Lemma 3.2. (see Lemma 4.4 of [A2]) There exists an N such that for any differ- 
ential operators D\ and D 2 in 



^T]Tn(A)- 1 /' \Y / D 1 E(x,p(a,X)(f)(cP x ),X)D 2 E(y,<l )x ,X) 



d\ 



iV ii .11 JV 



KciDuDJWxrWy 



Here ||x|| denotes a "norm" or "height" function on G(A), which we may (for 
GL(2)) take to be ]X IWI, witn \\ x v\\ = sup{\(x v )ij\ v , K^ 1 )^}- This norm 
satisfies the properties ||x|| = ||#i#2|| < ||a;i||||a;2||, and \\x > \\ci, for some 

ci; moreover, it is commensurable with e a ^ H ^\ at least for x in a fixed Siegel 
domain. 

The above Lemma is essentially part of Langlands' theory of Eisenstein series; 
from it, we get 

(3.3) \D liX D 2 , y K x (x, y)\ < c(D 1 ,D 2 )\\x\\ N \\y\\ N , 

with a similar identity for K(x, y) itself . (Here D 2tV indicates that D 2 is operating 
only in the y variable, etc..) These identities say that K(x,y) and K x (x,y) are 
"slowly increasing" in each variable. It is by way of these facts that the finiteness 
of 



]T J \KlK{x,x)\dx 



is established. 



Lemma 3.4. (see Lemma 1.4 of [A3]) The truncation operator transforms "slowly 
increasing functions" ip into "rapidly decreasing" ones; more precisely, suppose ip 
in C°°(Z(A)G(F)\G(A)) is such that for some N (the "degree of slow increase"), 

\Dip(x)\ < C D \\x\\ N for all x e & c 

and differential operators D. Then for any M 

\k T y{x)\ < Cm\\x\\~ m 



for x in & c (and some Cm)- 
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Remarks, (a) Since we are not proving Lemma (3.5), let us at least explain why 
it is plausible for G — GL(2). In this case, it is clear from Lemma 2 of Lecture II, 
that for x "near infinity", in particular, such that a(H(x) > a(T) > Co, 

k T tp(x) = tp(x) - ip N {x), 

Thus A T removes the constant term form cf>, making it rapidly decreasing at oo 
(think roughly of a classical modular form for SL 2 (Z) with vanishing constant term 
... ). 

(b) What Lemma 1.4 of [A3] actually asserts is that for any pair of positive integers 
M and N, we can choose D\, . . . D n such that for any x in 6 C , 



(3.5) \A T ^(x)\ < (sup \DM*')\ ■ \\x'\r N ) 



\x'\\- M 



But if ip happens to be slowly increasing of degree N, along with its derivatives, then 
the expression in parentheses is bounded, and so Lemma 3.4 immediately follows. 
It is this latter form of the Lemma — that is, (3.5) — that we shall now apply. 

Proposition 3.5. The truncated kernels K^K(x 1 x) and K^K x (x,x) are abso- 
lutely integrable, and 

/ A2K(x 1 x)dx = ^2 / ^2 K x {x,x) dx. 

Proof. Let us show that for any M, there exists constants c' and c x such that for 
x in S c , 

A^K{x,x)<c'\\x\\ N - M and A$K x (x, x) < c' x \\x\\ N - M . 

Since ||a;|| x e"^' 1 " on any Siegel domain, this will suffice, by Iwasawa's decom- 
position, to prove that 

J |A^if(a;, x)\ dx < oo, 

and hence by (Lebesgue's) dominated convergence theorem, that 

/ A2 K(x,x)dx = ^2 / AjfisT(x, x) dx, 
J x J 

an absolutely convergent sum. 

Applying Lemma 3.4 (in the form of identity (3.5)) to the second variable of 
K(x,y) yields 

|A^,y)|<C7 ||y|r M .sup sup (\D t . y K(x,y')\\\y'\\- N ), 

y' D 1: ...D n 

which by (3.3) is dominated by 

Co||y|r M sup(C7(l,A)||xri| y 1| w )||yir w = Co|| y |r w sup(C(l,A))||x|| Ar . 

y' ,i i 

So now setting x = y gives the desired conclusions (for K(x,x)), and for K x (x,x) 
one argues similarly. 
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4. Proof of (2.9) Relating / A%K x (x, x) dx to J k x (x,t)dx. Note first that 
when x = (G, 7r), there is nothing to prove. Indeed, if is the kernel of R(f) 
restricted to any cuspidal subspace L„ of L 2 (Z(A)G(F)\G(A)), then 

y — ► K n (x,y) 

is itself cuspidal: for any ip* in (Lq)- 1 , 



K 7I (x,y)(f*(y) dy = R(f) ° (Projection onto C L ) = 0. 

Thus it follows 'K v (x,y) = K w (x,y). On the other hand, as recalled in §1, 
Kp a (x,y) = for all proper P in G (when \ = (G, n)). Thus we also have 
(from the definition (2.1) of k x (x,f)) that k x (x,f) = Kq iX (x,x) = K 7T (x,x), i.e., 
A2 Kt T (x,x) = k x (x, /), and there is nothing to prove. 

Henceforth, we shall fix G = GL(2), and assume \ = {(M,<r)} with M — 

and a ( ^ ? ) = A*( a /^) ( or rather its class, relative to the relation 



b J J ' V b , 

/i ~ /i -1 in (F a: \A 1 ) A ). Adapting the general arguments of §2 of [A3] to this case, 
we shall now (at least) prove that 



/ 



A\K x {x,x) - k x (x, f)\ dx = 



for sufficiently large T. 
By definition, 



{AiK x {x,x)-k x (x,f)) = 

V / K x (x,nSx)f B (H B (Sx) -T)dn 

5eB(F)\G(F) J 'N(F)\N(A) 



K BtX (6x, Sx)t b (H b (Sx) - T). 



5eB(F)\G(F) 



Note that K x (x,n5x) = K x (Sx,nSx) (since E(x,4>) is left G(F)-invariant). There- 
fore the above difference equals 

(4.1) V t b (H b (5x)-T)\ [ K x (Sx,nSx)dn- K Bx (5x,5x)\ . 

SeB(F)\G(F) {Jn(F)\N(A) J 

Next recall our expression (1.6) for K B x (and Ka, x ) in the Concluding Re- 
mark (b) of §1. Applying it to (4.1), with y = 5x, we get that the expression in 
parenthesis equals 

/ K x (y,ny)dn- K B>x (y,y) = 

Jn(f)\n(a) 



I \I 

J Z(A)M(F)\M 1 (A) [JN 



(K(y, nmy) - K B (y, my)) dn 

1 Z(A)M(F)\M 1 (A) I J N(F)\N(A) 



X 



//(to) + n 1 (m) dm. 
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Applying the definition of K and K B , we find, in turn, that the last expression in 
parentheses equals 

/ ^ f(y~ 1 ~/nmy)dn— / ^ f(y~ 1 ~/nmy)dn 

Jn(f)\n(A) ieZ(F)XG{F) Jn(A) ^ z(f) \ m(f) 

= I I fiy^jnmy)- ^ f{y~ 1 'ynmy) dn 

J N(A) y 7£jV (F)Z(F)\G(F) JN{A) aiGZ ( F )\ M ( F ) J 

Now apply Bruhat's decomposition 

Z(F)N(F)\G(F) = Z(F)\M(F) U (Z(F)\M(F))wN(F) 
to the first summation above. The result is that the entire last expression equals 

/ J]] f(y~ 1 fiwvnmy)dn. 

JN(A) veN ( F ) fj, e z{F)\M(F) 

Using this, it remains to show that 
J \A^K x (x,x) - k^(x,f)\dx 



< E r B {H{5x)-T) 

J Z(A)G(F)\G(A) SeB{F)XG(F) 



/ / Z^Z^f( x 1 ^ w vn m ^x)(f J -( m ) + I 1 1 (m)dmdx- 

J Z(A)M(F)\M 1 (A) Jn(A) 

[ r B (H(y)-T)x 

JZ(A)B(F )\G(A) 

JJ^2^2f(y~ 1 6nwvnmy)(fi(m) + ^~ 1 (m)^ dmdy 



I Z(A)B(F)\G(A) 

(4.2) 



= for sufficiently large T. 

So suppose y is such that this expression (4.2) is not zero. Then our hypothesis 
on / implies 

y^ 1 [iw v nmy 

belongs to the compact (modZ(A)) subset ftf of G(A). Writing out the Iwasawa 
decomposition n\ak for y implies 

g = a~ 1 n^ L 1 vw finmnia G Cl'f 

with Q'j the compact ( mod Z(A)) set KQfK, i.e., a{H{g)) remains bounded 
(from below and above). 
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On the other hand, write 



g = a 1 n 1 1 is w /inmnia 



= (n^V) a a- L )w^ma(n m m) a 
= n a wn 

where n*,n' e N(A) and a* = a- 1 a w (^m) m . Then 

c < H(g) = H(a*) + H{wri) = -2H{a) + H{wn') 

implies 

-2H(a) >c-H(wn'), 

i.e. 

(4.3) H{ a)<^l- C -<^ = d 

(using Lemma 4.1 of Lecture II). 

To complete the proof, take T "sufficiently large" to mean that a(T) > c'. Then 
the condition fs(if(y) — T) = f B (H(a) — T) ^ will be incompatible with (4.3), 
i.e. (4.2) will be identically zero, as claimed. 



5. An Alternate Strategy?. In any treatment of the trace formula prior to the 
1978 paper [A2] (see [Al]), no modified kernels appear. For example, in [GJ] one 
simply and (formally) truncates both sides of the identity 

K(x,y) = ^/(x- 1 7 y) = ^K x (x,y) 

7 

to obtain a trace formula (for GL(2)) of the form 

(5.1) / A^K(x,x)dx = I AlK x (x,x)dx. 

J x J 

Our purpose now is to explain where the modified kernels are actually hiding in(5.1) 
and why they can not be avoided in the general theory. 

Using the crucial Lemma 2.1 of Lecture II, one can easily prove (see p. 235 of 
[GJ]): 

Lemma 5.2. For T sufficiently large {i.e. a(T) larger than the constant da f 
described in Lemma 2.1 of Lecture II), 

A?K(x,x)= K K X J) + E Kj(xJ)^K T (xJ) 

elliptic unipotent 

or 

hyperbolic 
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Note that Lemma (5.2) implies that the truncated geometric kernel used in (5.1) 
is already a sum of the modified geometric kernels. Thus an alternate strategy for 
obtaining the trace formula 

(5-3) £^(/) = £^(/) 

« X 

suggests itself: since we also have 

A%K(x, z) = £ ^K x (x, x). 

x 

(the spectral analogue of the geometric formula for ti^K(x, x) afforded by Lemma 
(5.2)) , why not just integrate each of these expressions term by term to obtain the 
trace formula identity (5.3) with 

Jj(/) = / A^K x (x,x)dx? 

JZ(K)G(F )\G(A) 

Indeed, by Lecture II we know the left hand side of (5.3) is nicely behaved, and by 
the first three sections of this lecture, we know the right hand side converges too. 

This strategy was in fact carried out in [GJ] for GL(2), where it turns out that 
terms on either side of (5.3) can be computed completely explicitly, as (linear) 
polynomials in T (see §6 of [GJ] and §§1 and 2 of the next lecture). Then when 
these expressions are plugged into the formula 

trfl (/) = £ J ° T Cf)- £ 4(f), 

the terms depending on T cancel out (as they must, since the left hand side is 
independent of T). Thus we are left with our sought-after formula for tr Ro(f). 

What goes wrong with this strategy in general? Well, for an arbitrary reductive 
group G, it would be hopeless to try and compute each term Jj(f) and J x {f) 
completely explicitly; fortunately, however, it is also unnecessary. What Arthur's 
more indirect strategy (involving modified spectral as well as geometric kernels) 
allows one to prove is that J x (f) arid Jj {f) are a priori polynomials in T (for 
sufficiently large a(T)). Thus we can concern ourselves only with the constant 
terms of these polynomials (since our goal is to make (5.3) explicit, and only terms 
not depending on T will survive on the right-hand side ... ). These constant terms 
((J (/) = </„(/) and (J x (f) = J x (f)) are by no means trivial to compute, but they 
do turn out to be feasible to treat in Arthur's general theory, as we shall soon see. 
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Lecture IV. More Explicit Forms of the Trace Formula 



In the first few lectures, we have described the first form of Arthur's trace for- 
mula, which in its symmetric formulation reads 

(*) E J o T (/) = E J x T (/); 

X 

more suggestively, 

(**) tr( J R (/)) = E J o T (/)- E J x(f)- 

o x ex\x(G) 

After developing this formula in [A2] and [A3], Arthur devoted the next ten years 
(and several hundred pages of work) to rewriting this formula in a more explicit 
form, suitable for applications. In particular, in order to apply (**) effectively to the 
theory of automorphic forms, it seems essential to know a lot about the distributions 
Jj(f) and J%(f)- How do they depend on T? How can they be expressed explicitly 
in terms of weighted orbital integrals or weighted characters? How can they 
be factored into local distributions on the groups G v ? 

Our purpose in the remainder of these lectures is to explain some of Arthur's 
answers to these questions. But to make this task easier, we proceed in stages. 

One of Arthur's first general results is that each distribution Jj(f) or (Jj(/)) is 
a polynomial in T, of degree at most dim(Ap / Z) . Another is that each "unram- 
ified" Jj{f) (or Jj(/)) has a fairly explicit representation as a weighted orbital 
integral (or character). To motivate these and other general results, we shall first 
recall the familiar example of GL(2). 

1. Explicit Results for GL(2) (Geometric Side). As we already observed in 
general in Lecture II, the elliptic terms Jj {f) are independent of T, and simply 
equal ordinary (factorizable) orbital integrals (see Proposition 2.5 of Lecture II). 
On the other hand, the hyperbolic and unipotent terms are non-trivial (linear) 
polynomials in T, explicitly described in the propositions below. 

Proposition 1.1. If o denotes a hyperbolic class with representative 7, then for 
a(T) sufficiently large, 

Jj(f) = (Ti - T 2 )m(F*\A 1 ) / /Or" V) dx 

ilf(A)\G(A) 

- ^-m(F*\A 1 ) [ f{x- 1 1 x)a(H(wx) + H{x)) dx 

2 Jm(A)\g(A) 
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Remarks. (1) The proof of this proposition is provided by the formal computa- 
tions of [GJ, pp. 237-238] now justified by the already established a priori absolute 
convergence of J kj(x, f) dx. The assumption a(T) » is needed so as to be able to 
apply Lemma 4.1 of Lecture II (which in the case of GL(2) just requires a(T) > 0). 

(2) In §3 of Lecture I, we saw that the unmodified hyperbolic kernel K was not 
integrable because of the appearance of the (divergent) volume of F X \A X ; Propo- 
sition 1.1 shows that the integral of the modified kernel indeed "truncates" this 
volume to a finite interval of length (7\ — T 2 ), which appears as a linear term in T 
with coefficient f f(x~ 1 jx) dx; however, the constant term of this polynomial no 
longer involves an invariant orbital integral (because of the presence of the "weight 
function" a(H(wx)) +H(x)). 

Proposition 1.2. For the unipotent class o, and a(T) sufficiently large, 

J (f)=m(Z(A)G(F)\G(A))f(l) 

+ f.p. (C(^ S )) + (T 1 -T 2 )to(^*\A 1 ) f F(y)dy 

s=l J A 

where 

F(y)= f fik-^l^tydk ismS(A), 

and f. p. s=1 (((F, s)) denotes the "finite part" at s = 1 of the (meromorphic) zeta- 
function 

((F,s) = [ F(a)\a\ s d x a 

of Tate. 

Remarks. (1) Iwasawa's decomposition implies (formally) that 

C(F,s) (ats = l) = ^^/(fc- 1 (^)fc) dk \a\d x a 
= 1 f(x-*(l\)z)dx, 

JZ(A)N(A)\G(A) V V 7 7 

the non-trivial unipotent orbital integral of /. Thus the constant term of this Jj(f) 
is indeed analogous to the constant term of the hyperbolic jj(f): one involves a 
regularized (unipotent) orbital integral, and the other a weighted (hyperbolic) 
orbital integral. 

(2) Because the left hand side of (**) is independent of T, it must be that the linear 
terms in T appearing in both the hyperbolic and unipotent Jj{f) cancel out with 
the linear terms appearing in Indeed one can check (using Iwasawa's 

decomposition) that the linear terms in Propositions 1.1 and 1.2 (for all hyperbolic 
o) combine into the single term 

L{T) = {T l -T 2 )m{F*\A 1 ) ^ 11 f {k' 1 ^nh) dn dh; 

y£Z(F)\M(F) J N(A) 



32 



STEVE GELBART 



then this term is seen (later) to exactly cancel the linear term in T arising from 
E x J * (/); see § 2 below > also P- 239 of [ GJ ])- 

(3) The proof of Proposition 1.2 again results from formal computations — see p. 236 
of [GJ] — all of which are justified now by the a priori absolute convergence of 
/ kj(x, f) dx. As in the hyperbolic case, the appearance of the linear term, and the 
"shearing off" (i.e., regularization) of the relevant orbital integral, are explained by 
the fact that the kernel K (x, x) was modified before being integrated. 

(4) The need to modify K a (in this unipotent case) is clarified by the exercise below. 
Exercise. Show that for the unipotent class o, 

Jj(/) = / K (x,x)dx 

JZ(A)G(F )\G(A) 

= val {((F,s)} +m(Z(A)G(F)\G(A))f(l), 

s=l 

with ((F,s) as in the proposition above. In particular, J (f) = oo when the 
"principal part" of ((F,s), namely m(F*\A 1 )F (0) / (s — 1), is non-zero. 

Solution. Recall that 

o = {l}Uo', 

where o' consists of the elements 

{5-'(H)6:teF*, SeB(F)\G(F)} 

Thus we compute 

k ,(x,x)dx= fj2 Yl (^^(Jj)^)^ 

J t^O S£B(F)\G(F) 

t _, Jz(A)B(F)\G(A) V V / / 

which, by Iwasawa's decomposition, equals 

/ / E/v^Oi 1 ')*) H~ 1 d x adk= [ (j2F(at))\a\d*a 
JkJf*\k*^ x V Vo i y / j pxXAx y^ j 

= [ F{a)\a\d x a 

with F(y) as defined in Proposition 1.2. 

Concluding Remark. As we shall see in the next lecture, the explicit expressions 
just obtained for Jj(f) (hyperbolic or unipotent) already suffice to express these 
distributions as (a sum of) products of local distributions J v (fv)', this fact will turn 
out to be crucial in yielding a simple form of the trace formula in Lecture V. 



Z(A)G(F)\G(A) 
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2. Explicit Results for GL(2) (Spectral Side). As already observed in gen- 
eral in Lecture III, the "elliptic" spectral terms Jj(/), where x = {(G,ir)}, are 
independent of T; they just reduce to the distributional characters 

Jj(/)=tr7r(/). 

As for the remaining "hyperbolic" and "unipotent" x, we have the following: 

Proposition 2.1. Suppose \ = is "hyperbolic" in the sense that \i ^ 

p,^ 1 {i.e., there is no 1- dimensional residual spectrum contributing to K x (x,y)). 
Then for T sufficiently large, 

/oo 
tr(p(/i, «)(/)) dt 
-OO 

poo 

tr{M{-it)M'{it)p(p,it)(f))dt 



with the intertwining operators M{s) {and their derivatives) to be explained below. 

Because the proof of this result directly generalizes to any G, we include it. 
By definition, for \ regular {p ^ p^ 1 ), 

(2.2)jJ(/)=/ (V/ E(x, p(p, it)(f)<j>, it)A T E{x, <f>, it)) dt dx. 

JZ(K)G(F )\G(A) V J-oo ' 

However, because we have already established the absolute convergence of this 
integral, we can interchange the orders of integration and write 

/oo 
{A T E{p{p, it)(f)cj>, it), A T E{cf>, it)) dt. 
-oo 

Here we have used also the well-known fact that A T is an orthogonal projection 
operator on L 2 , and hence {(f>i, A T </> 2 ) = {A T (f>i, A T 2 ), with (-,■) denoting the 
usual inner product in L 2 . 

To continue with the proof, we obviously need a more explicit expression for 
the inner product of these truncated Eisenstein series. This is provided by the 
"Maass-Selberg relations" (generalized by Langlands to an arbitrary G in [Lai]); 
in the present context they imply 

(2.4) 

{A T E{(j)' , it), A T E{4>, it)) 

= (Ti - T 2 )(<f>', <f>) + {M{-it)M'(it)4>', 4>) 

+ i-{(0',M(it)0)e 2<t ( Tl - r! ») - (M(zt)0',0)e- 2 ^ Tl - T2 )}. 
Here M(s) = M{s,p) is the operator 

(f> — ► M{s)4>{g) = / 4>{wng) dn 
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intertwining p(p,s) with p(p w ,—s). (Like E(g,<p, p, s), M(s) is initially defined 
only for Re(s) > 1/2, but is analytically continuable in C, and holomorphic on 
iR.) Note M(—s) — M(—s,p w ) maps p(p w ,—s) to p(p,s), and the inner product 
(0i, fa ) is defined by 

(0i,02)= / / 0, (mk)(f>2(mk) dmdk. 

JZ(A)M(F)\M 1 (A) 

Finally, M'(s) is the derivative j^M(s, p) : p(p, s) — ► p(p w ,—s) (defined by iden- 
tifying p(p, s) with p(p, 0) ...). 

Now formula (2.4) is valid not just in our case (when p is "regular"), but also 
when p = (U -1 ; in fact we shall require (2.4) below when we treat the "unipotent" 
contribution Jj(/). However, when p ^ p~ x , it is a simple matter to observe that 
the third (and most complicated) term on the right side of (2.4) actually vanishes. 
Indeed, as M(s) = M{s,p) maps into p{— s,^ -1 ), it follows that 0' and M(it)(f> 
transform under M 1 (A) according to distinct characters (namely, p and p -1 )- Thus 
the inner product 

{tf>',M(it)4>)= j ^\k)M{it)<j){h)dh{ [ p 2 (m)dm) 

J J Z(A)M(F)\M 1 (A) 

vanishes (and so, similarly, does (M(it)(j>', 0)). So plugging (2.4) into (2.3) simply 
yields 

/oo /*oo 
tr(p(/Mt)(/))dt+ / tv(M(-it)M'(it)p(p,it)(f))dt, 
-00 </ —00 

as required. 

We now move on to the more problematic case when p = In this case, 

plugging in (2.4) into (2.3) yields (in addition to the terms in (2.5)) the term 



(2.6) E(T) = r {(p{n,it)(f)<t>,M(it)<t>) 
, J-00 I 



e 2it(Ti-T 2 ) 

-2*t(Ti-T 2 ) 



- (p(p,it)(f)<j>,M(-it)<j>)- — 1 dt. 

(Here we have used the fact that the adjoint of M(s) is M(— s).) But this still 
isn't enough to describe J%(f)- In this singular situation, what's missing is the 
contribution from the one-dimensional residual spectrum, i.e., the contribution 

J»' TeS (f) = M/) / p(x)A T (pJx)dx 

JZ(A)G(F )\G(A) 

from the residual kernel 

rT. 



K 1 / cs {x,y)^p{f)p{x)p{y). 
Analyzing both these functions of T (namely E(T) and </J' rcs (/)) yields finally: 
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Proposition 2.7. For a(T) > 0, and \ = {(M, /x)} with /x = ijT 1 , 

/CO /* oo 

tr(p(n,it)(f))dt+ / tv{M{-it)M'Ht)p{^it){f))dt, 
-co J —oo 

+ \ tr(M(0)p( M , 0)(/)) + ,i(/)m(Z(A)G(F)\G(A)) 
pZws n\(T), a negligible term (forT very large). 

Proof. In [GJ, pp. 239-240], a Fourier analysis argument is given to show that 

£(T) = itr(M(0)p(M,0)(/))+n(T) 

with ri(T) negligible for a(T) large. On the other hand, it is a pleasant exercise to 
check that 

^J' rcs (.f) — Kf)m(Z(A)G(F)\G(A)) as T - oo. 

Indeed, 



JZ(A)G(F)\G(A) 



= /*(/) / fe) - E PO*)™ - T)) da: 

= / - M/) / E fs - T ) dx 

<5eB(.F)\G(F) 

= /i(/)m(Z(A)G(F)\G(A)) - /*(/) / f B (# (a;) - T) dx 

J Z(A)B(F)\G(A) 



with 



= p(f)m(Z(A)G(F)\G(A))-n 1 (T) 

-2(T!-T 2 ) 

m(T) = /j(/)m(F*\A 1 ) >oo asa(T)^oo. 



So setting ^i(T) = ri(T) + n\{T) completes the proof. □ 

Concluding Remark, (a) Proposition 2.1 expresses directly as a poly- 

nomial in T when \ is regular. But as already suggested, Arthur has proven in 
general that J^(f) is polynomial in T . Thus it must follow that the "negligible" 
term n\(T) (appearing in the expression for </J(/) f° r singular \ m Proposition 2.7) 
is actually zero identically. (Indeed, it is a polynomial in T, which goes to as T 
goes to infinity . . . .) On the other hand, the expression E(T) (or equivalcntly 
(6.28) in [GJ, p. 239] is definitely not a polynomial; rather E(T) includes a negli- 
gible term r\ (T) which must cancel the negligible (exponentially decreasing) term 
ni(T) appearing in the expression for Jj' res (/)- 

(b) Our computation of 

^ T ' ros (/) = M(/) / n(x)k T QUx))dx 

J Z(A)G(F)\G(A) 
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really amounts to the computation of the inner product of two truncated "degen- 
erate Eisenstein series" (induced from fj, on P = G . . . ) . In general, the explicit 
description of Jj' res (/) with \ "singular" will involve an inner product 

(2.8) (A T E P (<j>',\'),A T E P (cj>,\)) 

where <j) in Indp|*j a belongs to L 2 {P) X , but P does not belong to P x . Thus the 
natural generalization of Proposition 2.7 must involve Arthur's generalization of 
Langlands' formula (valid for P ^ P x , but like the formula for Jj' res (/) giving only 
an asymptotic expression for (2.8); see [A9] ). 



3. Results for General G. Arthur's generalization of the above results, and in 
fact the GL(2) results themselves, seem much easier to digest and understand when 
viewed not as a long chain of unrelated results, but rather as realizations of a few 
fundamental phenomena and principles. Of course Arthur knowingly developed his 
trace formula with these thoughts in mind. To begin to explain them, let us start 
with the symmetry of the formula itself. 

Remark 3.1. Consider the formula 

E J o(f) = E ■£(/)■ 

Although at first sight the indexing sets D and X seem asymmetric and unrelated, 
they can actually be viewed as mirror images of one another. Indeed, if oeO, 
consider those (standard) parabolics B of G which are minimal with respect to the 
property that o meets Mb- Then o n Mb is a finite union of Mb{F) conjugacy 
classes which are "elliptic" in the sense that they meet no proper parabolic subgroup 
of Mb- Moreover, if Wo denotes the restricted Weyl group of (G, Ao), then D is 
in bijective correspondence with the set of Wo-orbits of pairs (Mb,Cb), where 
B is a parabolic subgroup of G, and cb is an elliptic conjugacy class in Mb(F). 
Thus D indeed corresponds naturally to X, itself defined to be the set of Wo-orbits 
of pairs (Mb,tb), where tb is an irreducible cuspidal automorphic representation 
of Ms (A). This description of D has the additional benefit that it highlights the 
analogy between "elliptic" classes and "cuspidal" representations. (For convenience 
here, we have ignored centers . . . .) 

In light of the symmetry between D and X (expressed by this Remark 3.1), it is 
not surprising that most of the general results Arthur proves for the distributions 
Jj(f) have an immediate analogue for Jj(/). For example, if o = {(Mb,cb)} is 
such that Jj(f) is a polynomial of degree at most dim(As/Z), then so is Jj(/), 
for x = {{Mb^b)}- Or, if o = {(Mb,cb)} is suitably unramified that Jj{f) 
is expressible as a weighted orbital integral, then the corresponding J x (f) should 
be expressible as a weighted average of characters. Let us explain these matters in 
some more detail now, and along the way bring into play some other basic principles 
as well. 



LECTURE IV. MORE EXPLICIT FORMS OF THE TRACE FORMULA 



37 



Proposition 3.1 (see Proposition 2.3 of [A4]). For each fixed f, Jj{f) is a 
polynomial in T of degree at most dim(ap /aG) {with Pq the minimal parabolic). 

The proof has two main ingredients. First, for any parabolic P D Pq, and point 
X in ao, a certain characteristic function 

T' P {H,X)= J2 {-l) dim(Aa/Z) T^{H)T R {H - X) 

{R: RDP} 

is defined on do, with Tp the characteristic function on do of the set 

{ff£o : a{H) > 0, a G A^} 

and Tp of the set 

{Hea : u{H) >0,iie Ap}. 

(Here Ap is the dual basis in corresponding to the basis of coroots {a v : a G Ap} 
in Or.) 

For example, for GL(2), and X = (X ,-X ) in etf, T' B {H,X) = t b {H)~t b {H- 
X); as a function on the one-dimensional subspace aff = {(T, — T)} w {T}, this is 
just the characteristic function of the interval [0, Xq\. 

On the other hand, for G = GL(3) and {X 1 ,X 2 , ~{X 1 + X 2 )) = X in a+, the 
function T" p { • ,X) (viewed on the two-dimensional space {{r%, r 2l — (r\ + ^2))}) is 
just the characteristic function of the shaded region below: 




Figure 1 

In general, the crucial property of T' p {H, X) is that it has compact support, and 
that its integral 

/ T' P {H,X)dH 

is a polynomial in X of degree q = dim{Ap/Z). (Here is the subspace of dp 
annihilated by cig, so in particular, ap = dp © ac-) 

Second, certain geometric distributions J* /<3,T (analogous to J J) are defined for 
each Levi subgroup M of G. Namely, for any fixed suitably regular point T\ in 
a+, class in D and f Q in C C 00 (Z(A)\M Q (A)), define 



m q ,t x 
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where o n Mq(F) is the union (possibly empty) of "classes" Oi, . . . o„ in Mq. (If 
o n Mq (F) = 0, then J Mq,Ti = 0; on the other hand, if M = G, then J M ' Tl = jj 1 .) 
To prove Proposition 3.1, Arthur derives the formula 

(3-2) Jj(.f) = J ° MQ ' Tl / T Q^ T T i) 

QDPo Q 

where /q in C^°(Z(A)\Mq(A)) is explicitly determined by / (and T varies freely 
through Ti + oj). Since each JT' Q (H,T - T x ) dH is a polynomial in T - Ti of 
degree af mosi dim(^4p /Z), it follows that the same is true of Jj(f), as claimed. 
Note that Jj(f) will be independent of T, i.e., be of degree zero, precisely when o 
is elliptic, i.e., has empty intersection with every proper parabolic Q appearing in 
(3.2) above. 

Remark 3.3. As expected, there is an analogue of Proposition 3.1 for Jj(/), 
proved in just the same way (in keeping with the principle that the geometric and 
spectral terms are different facets of the same type of object). For \ m an d M a 
Levi subgroup of G, one similarly defines distributions J^f' T on M(A) and proves 
(analogously) that 

(3.4) 4(f) = Jx Q ' Tl I V Q {H,T-T x )dH 

QDPo Ja Q 

is (again) a polynomial in T of degree < dim(A /Z). Interestingly, one also obtains 
the equality 

(3-5) £ Jf < T (/) = J2 J x' T Ul 

which may be viewed as (a first form of) Arthur's trace formula "relativized" to an 
arbitrary Levi subgroup Mp of G\ This relativization is reminiscent of Langlands' 
use of the spectral decomposition of L 2 (P) (P any parabolic of G) in order to 
describe L 2 (G) itself, and it constitutes another of the unifying themes in Arthur's 
work. 

To turn next to explicit formulas for Jj(f) (and Jj(/)), we need first to recall 
Arthur's notion of an unramified class of datum. For \ = {(^B: r s)} this just 
means that for any pair (Mb, Tb) in x, the only element of 0(ap, Op) fixing tb is 
the trivial element. On the other hand, for o = {(Mb,cb)}, the notion is a little 
less obvious. 

Namely, fix a class o, and choose a parabolic subgroup P and a semisimple 
element 7 in such that 7 belongs to M(F) but not to the Levi subgroup of 
parabolic properly contained in P. Then let XXt) denote the (possibly empty) set 
of roots a in = J2(P>Ap) such that the centralizer of 7 in t) a (the root subspace 
of the Lie algebra of Np belonging to a) is not zero, and let A' be the intersection 
of the kernels of these roots a (regarded as characters of A). Now assume that 
every element in is semi-simple, and choose a parabolic subgroup Pi, and an 
element w G 0(o, Oi) such that Ap 1 = wA'w^ 1 . Setting 71 equal to w-fw^ 1 , we call 
unramified if the centralizer G(7i) is contained in M\. 
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Note that 

,-(; ;) or T _(S j o 

determines a semi-simple class in GL(2) (or GL(3)) which is unramified, with P 
the Borel subgroup, A' = Ap, Pi = P, and 71 = 7. On the other hand, any class 
in GL(3) containing a non-semisimple element, like say 

'a 0' 
7 = I 1 x 

i r 

is automatically ramified, and its corresponding distribution Jj(f) will not be of 
the nice type described in the proposition below. 

Proposition 3.6. Suppose is an unramified class in D, and 71 is a (semi- 
simple) element in 0. Then for sufficiently large T , 

jj(f) = m(G 7l (F)\G^ (A)) / f{x-^ix)v{x, T) dx, 



with v(x, T) (the weight function) equal to the volume of the convex hull of the 
points 

{w' x T -w^Hp^wx): w G (JO(ai,a 2 )} 

P2 

projected onto Oi/ac- 

Let us describe v(x,T) graphically for G = GL(2) or GL(3), with 71 a diagonal 
element (with distinct eigenvalues). We shall also sketch the proof of the proposition 
for GL(2), making clear how the volume v(x, T) naturally arises. 
N.B. We should really write v (x, T) for v(x, T), since this function indeed depends 
on the nature of 0. 

For GL(2) or GL(3), and 71 as just specified, v(x,T) is precisely the volume of 
the convex hull of the projection of 

{u; _1 T - w~ 1 H Po (wx): w e 0(a Po ,ap )} 

onto ap /ac. For GL(2), this is just the length of the line segment determined 
by the points (Ti,T 2 ) — H(x) and (T 2 ,Ti) — wH(wx) in ap„ projected onto the 
one-dimensional space Op /aG, i.e., 

v(x, T) = 2(Ti - T 2 ) - (a(H(x)) + a(H(wx))). 

Thus Proposition 3.5 indeed reduces to Proposition 1.1. 

On the other hand, for GL(3), v(x,T) (at least for T = 0) reduces to the area 
of the convex set 

Hpjx) 



Hp(x) 
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Figure 2 



whose extreme points are indexed by the Weyl group elements {wi = e, W2, ■ ■ ■ , wq}. 
Note here that 



where P w denotes the (not-necessarily standard) parabolic subgroup w~ 1 P w, and 
Hp w (x) is defined analogously to Hp a using the decomposition G = P W K. (Indeed 
vox = nmok implies w _1 Hp (wx) = Hp (mo), and x = w~ 1 nmok implies 
Hp™i x ) — Hp (w~ 1 mw) = w~ 1 Hp (m ), i.e., Hp m {x) = w~ 1 Hp (wx).) Thus we 
could have alternately described v(x) — v(x, 0) in terms of the points 



where V(Mq) denotes the set of parabolics (not necessarily standard) in G whose 
Levi part equals Mq. 

Remark 3.7. Weighted orbital integrals of the above type were first systemat- 
ically studied by Arthur in the context of real groups. In [A8], Arthur studied 
general properties of these real weighted orbital integrals, motivated by a sugges- 
tion of Langlands that such integrals would arise in any projected general treatment 
of the trace formula. In particular, for matrix coefficients of discrete series repre- 
sentations, Arthur related J f(x~ 1 jx)v(x) dx to the characters of the discrete series 
(whence the title of [A8]). On the other hand, Arthur's weight functions v(x) were 
also interpreted in [A8] in terms of what would eventually be known as (G, M) 
families. We shall return to this point in a later lecture, where (G, M) families 
are discussed in earnest. 

On the Proof of Proposition 3.6 for GL(2). Precisely because is unramificd 
(in this case, a hyperbolic class with semi-simple element 71), the centralizer G(7i) 
is contained in M (in this case equals M) and we can express the modified kernel 
as 



w 1 Hp (wx) — Hp w (x), 



{H P (x): PeV(M )}, 



kJ(x,T) 



8eM(F)\G(F) 
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Thus 

= / kJ{x,T)dx 

JZ(A)G(F)\G(A) 

J Z(A)B(F)\G(A) [ neN(F) 

- / /(x^n^jmx) dn t b (H(x) - T) \ dx 

JN(A) J 

(here we made the change of variables n — > (7^ n _1 7i)n in the second integral) 
= / f{x- 1 11 x){l-T B {H{x)-T))dx, 

J Z(A)M(F)G(A) 

(here we used the decomposition 

Z(A)B(F)\G(A) w N(A)Z(A)M(F)\G(A) ■ N(F)\N(A), 

carrying out first the integration over N(F)\N(A)). But now observe the following: 
because we know kJ(x,T) is absolutely integrable, we also know that the last 
integral 

A = [ f{x- 1 j 1 x)(l-T B (H(x)-T))dx 

J Z(A)M(F)\G(A) 

converges absolutely, and equals (from the change of variables x — > wx) 
B = j f{x- 1 ll x){l-T B {H{wx)-T))dx, 



i.e., 



= a = Ua + b) 



= \\ f{x- 1 llX )(l-T B (H{x)-T)-T B {H{wx)-T))dx 

J Z(A)M(F)\G(A) 

= I /(x-^xXix^dx, 

JM(A)\G(A) 



with 



v*{x,T)= [ {l-T B {H(mx)-T))-T B {H(wmx)-T))dm, 

Jz(A)M(F)\G(A) 



IZ(A)M(F)\G(A) 

and it remains only to prove: 
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Claim. Setv^(x) = l-T B (H(x)-T)-T B (H(wx)-T). Then 

v*(x,T)= / v^p(mx)dm= Arthur's weight factor v(x,T). 

J Z(A)M(F)\M(A) 

Proof. Note that v^(x) is identically zero unless 

(a) Both tb(H(x) — T) and tb(H(wx) — T) are 1 (in which case v^(x) = — 1); 
or 

(b) Both arc zero (in which case v^(x) = 1). But the first possibility for 
v^(mx) 7^ implies that 

a(H(mx)) > a(T) and a(H(wmx)) > a(T), 

which (for a(T) sufficiently large ... ) implies (by Lemma 4.1 of Lecture 
II) that w G B(F), an obvious contradiction. On the other hand, it is 
straightforward to check that the second possibility for v^(mx) ^ implies 

a(H(mx)) - (Ti - T 2 ) < a{H{m)) < (7\ - T 2 ) - a(H(x)). 

Thus the decomposition 

Z(A)M(F)\M(A) = U a Q °A :a€F'\A 1 \.\( e * J 



/v^(mx)dx= / d x a I 
JF^XA 1 Ja 



(Ti-T 2 )-a(.ff(x)) 

dt 

F^XA 1 Ja(H(wx))-(T 1 -T 2 ) 

m(F x \A 1 )v(x,T). 



□ 



Remark 3.8. The analogue of Proposition 3.6 for a spectral unramified dis- 
tribution J%(f) is also a "straightforward" generalization of the GL(2) situation 
(Proposition 2.1), namely, we have: 

Proposition 3.9. For \ — {(M,r)} unramified, 

J x(/)= E ZTTT / tr(MT(a) x p(a,\)(f))d\, 

p e -p x n \ A P) Jia* p /ia* G 

with Mp(a) x the operator on Indp(cr, A) defined by 

c P hm > y, — 

P2ev xW en( aPl ,ap 2 ) II ( w 0(a V ) 

ueAp 2 

and M(w,C 1 a) the operator intertwining Indp x a\ | A+< » with Indp 2 o w \ \ w ( x +0 . 

As suggested in §2, the crucial point of the proof is Langlands' formula for the 
inner product of truncated Eisenstein series, and since this is valid for general 
G (for Eisenstein series Ep with P e V x ), the proof of Proposition 2.1 generalizes 
directly. The interesting point here is that the integrals appearing in the description 
of J x {f) above really should be viewed as weighted character sums. In fact, using 
the notion of (G, M) families, Arthur eventually views both Mj(cr) and the weight 
functions v(x) as special examples of functions constructed out of similar (G,M) 
families; we shall return to this point in Lecture VII. 
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Lecture V. Simple Forms of the Trace Formula 



In this lecture and the next, we explain (at least for GL(2)) how the trace formula 
can be molded into a simpler form, and then used to obtain dramatic results in 
representation theory, number theory, and the theory of automorphic forms. In 
addition to providing a welcome respite from the general theory, this detour will 
provide impetus and direction for the further discussion of Arthur's development 
of a general trace formula. 

Roughly speaking, the trace formula takes on a simpler form when more re- 
strictive hypotheses are put on the test functions /. The more restrictive the 
hypotheses, the easier it is to establish the corresponding "simple trace formula" ; 
but the less restrictive the hypotheses, the more effective it is to apply the resulting 
trace formula to the theory of automorphic forms. 

For example, the "very simple" trace formula of Deligne-Kazhdan (described 
in §3 below) has nice applications to local representation theory, but cannot give 
complete results on the functorial lifting of automorphic forms, and cannot give 
applications to the computation of Tamagawa numbers. For such applications one 
seems to need the "simple trace formula of Arthur" , described in §2 below (in the 
context of GL(2)). 

We start by showing (in §1) that each term on the right side of the formula 
tr(i? (/))=£j T (/)- J2 J x(/) 

may be expressed as a finite sum of products of local distributions on each G v . 
Then we show (in §2) that for specially chosen / = sufficiently many of 

these local distributions vanish to ensure that the resulting trace formula reduces 
to the simple formula 

tvR d (f)=m(Z(A)G(F)\G(A))f(l) + ]T J (f) 



elliptic 

(as in the case of compact quotient). 
Henceforth, we assume 

f = Hfv 

V 

is such that for all v, f v <G C£°(Z V \G V ), and for almost all v, f v is the characteristic 
function of Z V K V . 
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1. Factorization into Local Distributions. We start with the geometric terms 
J (f)- Fix / = Hf v , and let Sf denote the finite set of places of F outside of which 
/„ is the characteristic function of Z V K V . 

Proposition 1.1. Suppose 7 is hyperbolic and belongs to 0. Then J (f) (the 
constant term of the polynomial Jj(f)) is expressible as the sum (over v in Sf) of 
the products 

C (D / fw(g~ 1 ig)dg) / fv(g^ 1 ig)v v (g)dg. 

y ^ v JM w \G w ' Jm v \g v 

Here v v (g) = a(H v (g) + H v (wg)) is the local weight function. 
Proof. It is easy to check that for g in G(A), 

v (d) = ^2 v v(9v), 

V 

with v v (g v ) = on Z V K V . Thus, Proposition 1.1 of Lecture IV immediately implies 

Jo(f) = c V / f(g~ 1 ig)v v (g)dg 

- iM(A)\G(A) 

= 5Z C (TI / fw(g~ 1 ig)dg) I f v (g~ 1 ~fg)v v (g)dg. 

v y ^ v JM w \G w > Jm„\g„ 

But if v $l Sf, the second integrand is identically zero. Thus the proposition is 
clear. □ 

The factorization of the unipotent term is a bit more complicated. Recall from 
Proposition 1.2 of Lecture IV that the constant term of the (non-trivial part of the) 
unipotent term Jj(f ) is computed by subtracting off from 

C(F, s) - J f(k- 1 (J")*) dk\a\ s d*a 

its principal part at s — 1, and then setting s = 1. 

So first rewrite this last integral as L(s, 1f)6(s), where 

fl(s) = ^M// /(rl (")' :)&HSfa - 

By Tate's theory, 0(s) is holomorphic at s = 1, whereas 

L(s,l F ) = Ai + A + ... . 

s — 1 

Therefore, the constant term J (f) we need to compute is A_i#'(l) + A #(l), and 
arguing as in [GJ, pp. 242-243], we get: 
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d_ 

ds 



Proposition 1.2. With Sf as before, and o unipotent, 

^///-(^(ii)*)* \< fa 

+ m(Z(A)G(F)\G(A)f(l). 

Remarks. (1) Each of the local unipotent orbital integrals 

/ fv(9- 1 ( 1 Q 1 1 )g)dg = t(F v ,l) 

converges, and for almost every v equals (1 — iV^ 1 ) -1 . It is the appearance of 
the "convergence factors" L(l, l^) -1 which makes possible the convergence of the 
product of these local integrals in Proposition 1.2. (The divergence of the global 
unipotent orbital integral 

v J 

was of course the reason for introducing the modified Jj '(/) in the first place . . . ). 
(2) If u $ S f , then 

Z//^" 1 (or) *) *l<^ - Z(^) ' C(1 °- S) = 1 

and its derivative is zero; that's why we need only sum over u £ Sf in Proposi- 
tion 1.2. 

We turn now to the spectral contributions. Here we need to recall some crucial 
facts about normalized intertwining operators. 
Recall the operator 

{M{s)<t>){g)= f ^(wng)dn (Re(s) > \) 
Jn(a) 

intertwining p(p,, s)with p(fj, w ,— s). This is the operator — or rather its analytic 
continuation to iR — which appears in our description of the spectral "constant 
term" J x (f){x = (-^A 4 ))- The problem is that in analytically continuing M(s) we 
lose its Euler product factorization (just as we lose the Euler product for £(s) in 
analytically continuing it to the left of Re(s) = 1). It is to restore this factorization 
that one needs the normalized intertwining operators R{s) recalled below. 
For a fixed additive character ip — J\ v tp v of F\A, and v any place of F, set 

m v {s, (j, v ,ip v ) - 



L(s + l,(j%)s(s,i4,ipv) 
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Then define local normalized intertwining operators 

R(s, p v ) : p(s, p v ) — > p(-s, p%) 

by the equality 

M(s, p v ) = m(s, p v ,ip v )R(s, p v ) 

(where M(s, p v )(f> is just defined by the local integral J N <j>(wng) dn, convergent 
for Re(s) 2S> 0). The crucial property of this normalized operator R(s,p v ) is that 
it takes the (normalized) if„-fixed vector </>° in p(s,p v ) (for p v unramified) to the 
unique normalized K v -Rxed vector in p{— s, p" 1 ); moreover, as an operator valued 
function of s, R(s,p v ) is holomorphic in Re(s) > —\. Thus we can define, for 
= H<j> v in p(s,p), 

R(s, p)(j) = Y[ R(s v ,p v )(j> v . 

V 

Since R(s, p v ){<t>v) = 4>v almost everywhere, this product makes perfect sense; more- 
over, R(s,p) is holomorphic in Rc(s) > — |. But clearly (at least for Re(s) » 0), 

M(s, p) = m(s, p)R(s, p), 

where 

m ( s ,^) = Y[™> v (s,p v ,ip)v). 

V 

So since m(s, p) is meromorphic, we indeed obtain the analytic continuation of 
M(s,p) to zK with its "Euler product" factorization intact. 
N.B. By the (global) functional equation of Hecke L-functions, 

L{s,p 2 ) L(l-s,p- 2 ) 
{ ,N L(s + l,M 2 )e(s,M 2 ,^) L(s + l,p 2 ) 

Proposition 1.3. For \ "unramified", the (constant term of the) spectral contri- 
bution J x (f) equals 

m'tit, p) . , , 
^^trWit,p)(f))dt 

/OO 
(II tr(p(it, Pv){fv)) tr(R u (it, flu)' 1 R* u (it, p u )p(p, «*)(/«)) dt. 

In case \ * s "ramified" (p 2 = 1), there are the additional terms 

(1.4) tr(M(0)p(/i,0)(/)) 

and 

p(f)m(Z(A)G(F)\G(A)). 
Proof. Straightforward (sec [GJ, p. 243]). □ 
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2. Simplifications of the Formula. A first kind of simplification comes form 
making the relatively mild assumption on 

that its local hyperbolic orbital integrals 

(*) / /(^(/o i)g)<rg = o 

Jm v \g v v v ' ' 
for at least two place v = v\ , v 2 ■ 

Proposition 2.1. With f satisfying the above assumption (*), we have 
tvRo(f) = m(Z(A)\G(A))f(l) + ]T J (/) - ]T p(f)m(Z(A)G(F)\G(A)) 

o elliptic li 1 — ! 

Proof. First consider the spectral contributions J x {f), with \ = {(W,/*)}- If 
v\ is a place where (*) holds, then tr p(it, p Vl ){fv) — (by a well known com- 
putation of the trace of induced representations). Thus the first term appear- 
ing in Proposition 1.3 immediately disappears. But if (*) also holds at a second 
place i>2, then it is clear that the second term there must also always vanish. On 
the other hand, if p? = 1, then M(0,p) intertwines the irreducible representa- 
tion p(0, p) with itself; hence M(0, p) must be a scalar operator. In particular, 
tr(M(0)p(0, p){f)) = Atr(p(0, p){f)) = by our assumption on /, and we conclude 
from Proposition 1.3 that the full spectral contribution = (M, p)J x {f) reduces 
to(E^ =1 M/))m((^(A)G(F)\G(A)). 

Now we consider the possible geometric contributions J (f), following Propo- 
sitions 1.1 and 1.2. For the hyperbolic contributions, it is again obvious that if 
assumption (*) holds at two places, then each J {f) is zero (just as for the unram- 
ified spectral forms). On the other hand, for the unipotent contribution, it suffices 
to make the following observation: 

For any local /„ in C^(Z V \G V ), 

l wa , '-i'- 1 (-)')*- W 1 - »"'/„ AO , ^ (:!)») *■ 

This is a simple yet important identity, which we encourage the reader to verify 
(using Iwasawa's decomposition and an appropriate change of variables in N). It 
implies that the local unipotent orbital integral of /„ vanishes as soon as its hyper- 
bolic orbital integrals vanish. In particular, as soon as (*) holds for two places, it 
follows that all of the unipotent term J (f ) vanishes, except for the term involving 

/(!)• □ 

In the next lecture, we shall explain how this simplification of the trace for- 
mula is used to prove the Jacquet-Langlands correspondence between automorphic 
representations between a quaternion algebra and GL(2), and to show that the 
corresponding Tamagawa numbers are equal. 



LECTURE V. SIMPLE FORMS OF THE TRACE FORMULA 



49 



3. The "Simple" Trace Formula. In the early 1980's, Delignc and Kazhdan 
introduced a remarkably simple trick into the trace formula repertoire, the so-called 
"simple trace formula" (see [BDKV]). As we shall see in the next lecture, it yields 
powerful local results on functorial lifting, with surprisingly little work. 

Theorem 3.1 (The Simple Trace Formula of Deligne-Kazhdan) . Suppose f = 
Yiv fv * n C%°(Z(A)\G(A)) satisfies the following two properties: 

(i) At one place v = v\, f Vl is the matrix coefficient of a supercuspidal repre- 
sentation of G Vl ; and 

(ii) at a second place v = v 2 , f V2 is supported on the set of regular elliptic 
elements of G V2 . 

Then 

(a) R(f) has its image in Lq(Z(F)G(F)\G(A)), hence is of trace class, with 

tvR(f)= tr7r (/); and 

ir cuspidal 

(b) we also have 

tri?(/) = V m(Z(A)G 7 (F)\G 7 (A) / f{x~ x -fx)da: 

regular 
elliptic 



Remarks. (1) The condition we really need in (i) is that f Vl is a supercusp form 

in the sense of Harish-Chandra, i.e., (like the matrix coefficient of any supercuspidal 
representation) , 

fvi (gnh) dn = for all g, h, in G v . 



(2) Both the theorem, and its proof below, hold more generally for any G; see [Rol], 
§1, for example. 

(3) This simple trace formula is indeed less subtle than the simple form of Arthur's 
trace formula discussed in the last section. In particular, note that the distribution 
/ — > m(Z(F)G(F)\G(A))f(l) disappears entirely from this simpler form of the 
trace formula (and hence, for example, no application to Tamagawa numbers is 
possible ... ). It is just this crudeness, however, that makes the formula much 
easier to prove! 

Proof. For (a), it suffices to show that R(f) has image in Ll(Z(F)G(F)\G(A) for 
then the theorem of Gelfand and Piatetski-Shapiro (see [GPS] and [Go2]) implies 
R(f) is of trace class). To check that R(f)<f> is a cusp form for any <f> we simply 
compute: 

/ R(f)(j)(nx) dx = / (/ V f(x~ 1 nvg)^)(g)dg) dn 

Jn(f)\n(A) Jn(f)\n(a) Vz(A)\G(A) veN(F) J 



I (I f{x- 1 ng)dn)cj>{g)dg = Q 

J Z{k)N(F)\G{k) K JN(A) 
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Since 




by our assumption (i) on f Vl . 

To prove (b), note that a 7 in G(F) is regular elliptic as soon as it is regular 
elliptic in G v = G(F V ) for some place v. But f(x~ 1 jx) 7^ implies {supp(/)} n 
{Orbit of 7} 7^ 0, which in turn implies {supp(/ t)2 )} n {Orbit of 7} 7^ 0. So by 
hypothesis (ii), 7 is regular elliptic in G V2 , i.e., 



Concluding Remark. Implicit in the proof above is the fact that Kf(x,x) is an 
absolutely integrable smooth function on Z(A)G(F)\G(A), and hence tr(i?(/)) 
(= tr(i?o(/))) is given by its integral. In fact (3.2) implies Kf(x,x) is compactly 
supported on Z(A)G(F)\G(A) (as we saw in Lecture II). Interestingly enough, if 
we drop the hypothesis (ii), so that (3.2) need no longer hold, it still follows (from 
(i) above) that Kf(x,x) is absolutely integrable (in fact, rapidly decreasing), and 
that tr R(f) = J Kf(x,x) dx; for a proof, see [Rol, §1]. 
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Lecture VI. Applications of the Trace Formula 



Let G = GL(2), and let G be the multiplicative group of a division quaternion 
algebra D over F. 

In this lecture, we shall explain how the simple forms of the trace formula are 
used to prove the following three results: 

(1) The Tamagawa number of G equals the Tamagawa number of G; 

(2) There is a bijection (the "Jacquet-Langlands correspondence") between the 
automorphic it' on G (which are not one-dimensional) and the automorphic 
cuspidal 7r of G (such that ir v is square integrable for each place v of F 
ramified in D); and 

(3) Given any finite set S of finite places v of F, and square-integrable n v on 
G v for v C S, there exists a cuspidal i on G such that (ir) v = n v for all 
v E S. 



1. Tamagawa Numbers. At the end of the 1950's, Weil conjectured that the 
Tamagawa number of any simply connected semi-simple group G equals one. At 
the end of the 1960's Jacquet-Langlands proposed a two step program for proving 
this: 

(1) Prove first that 

r(G ) = 1 

for Go the quasi-split inner form of 67, using the Eisenstein series method 
introduced by Langlands for split groups (see [La3] and [Lai]); then 

(2) Use the trace formula to prove that 

r(G) = r(Go). 

This second step was first carried out in §16 of [JL] (for GL2 in place of 
SL2), and then recently generalized to arbitrary G by Kottwitz to prove 
Weil's conjecture in general (see [Kot]). 

We should stress that — in deriving a simple trace formula for an arbitrary semi- 
simple quasi-split G — Kottwitz had to appeal not only to all of Arthur's work on 
the trace formula through 1988, but also to his own earlier works on the stable 
form of the trace formula. By specializing Kottwitz 's argument below to the case 
of GL(2) (in place of SL(2)), we manage to avoid both these bodies of work. 

We start by recalling the notion of Tamagawa measures for our G and G . 

Fix a non-trivial character tp = Y\ij) v of F\A. For each place v of F, i[> v de- 
termines a Fourier transform on A v = M 2 {F V ) or D v , hence also a self-dual Haar 
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measure dy on A v . The measure dy then determines a Haar measure cfa//|jy|| on 
A*, where \\y\\ is the module homomorphism from A v to R+. Hence the choice of 
ip simultaneously determines Haar measures on each G v and G' v ; similarly, Haar 
measures are determined on F v = Z v — Z' v . Such measures are called Tamagawa 
measures, locally and globally. The resulting measure of Z(A)G(F)\G(A) is inde- 
pendent of tp and called the Tamagawa number r(G) of G, i.e., 

r(G) = m(Z(A)G(F)\G(A)). 

Similarly, 

r(G') = m(Z'(A)G'(F)\G\A)). 
The center Z' may of course be identified with the center Z ~ F x of GL(2). 
Theorem 1.1. The Tamagawa numbers of G and G' are equal. 
We start the proof with a definition. 

Suppose v in F is ramified in D, i.e., D v = D ® F v is a division quaternion 
algebra over F v . Then each quadratic extension L v of F v can be regarded simulta- 
neously as a Cartan subgroup T' v — T' L of G' v and as an (elliptic) Cartan subgroup 
T v =T Lv of G V = GL 2 (F V ). 

Definition 1.2. We say a function /„ in (Z V \G V ) matches f v in G c °° (Z' V \G' V ) 
(and write /„ <~ f' v ) if: 

(i) / B (l) = /i(l); 

(ii) the regular hyperbolic orbital integrals 

/ fv{g- 1 ( a ° 1 )g)dg 
Jm v \g v v ' 

vanish identically; and 

(iii) for corresponding tori T v and T' v and t ~ i' , 

/ /,(5" 1 ^)rf5= / f' v {g- 1 t'9)d'g. 

N.B. The Haar measures on Gt, and G^ are (simultaneously) normalized by way 
of the Tamagawa measures recalled above; the measures on T v and T' v are fixed by 
a choice of Haar measure on L v . 

Fact 1 (Local Harmonic Analysis on G). Given any f' v in G£° (Z' V \G' V ) , there exist 
(infinitely many . . . ) f v in (Z V \G V ) which match f' v in the above sense. 

This crucial fact can be proved using a characterization of the orbital integrals 
on GL 2 a la "Shalika germs" ; see §6 of [La2] 

Using Fact 1, we can relate Tamagawa numbers to traces, and then prove The- 
orem 1.1. 
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Proposition 1.3. Let Sd denote the set of v in F ramified in D. Given f 
[I f' v in C?(Z'(A)\G'(A)), suppose f = Ufv in C~(Z(A)\G(A)) is such that for 
all v in Sd, fv ~ f' v [in the sense of Definition 1.2), and for all v ^ Sd, fv = f' v 
(via the natural isomorphism of G v and G' v ). (In this case, we say f "globally 
matches" f, and write f ~ /'). Then 

(1.4) tri? (/) - tr R' (f) = {r(g) - r(G')}(/(l) + M/)), 

with R' the representation ofG'(A) in the subspace of L 2 (Z(A)G' (F)\G f (A)) or- 
thogonal to all one- dimensional invariant subspaces. 

Assuming this proposition (for the moment), we give the: 

Proof of Theorem 1.1. If we rewrite (1.4) as 

(1.5) tvR Q (f)-tTR' (f')-{r(G)-T(G')} £ //(/) = {t(G) - r(G')}(/(l), 

fj, 2 =i 

then it clearly suffices to prove that both sides of this equation — valid for all / <~ /' 
as above — equal zero. For this, we follow an argument first explicitly introduced 
by Langlands in [La2]. 

Fix a place vo outside Sd, and consider the Hecke algebra H Vo (G Vo , K Vo ) . If we 
fix all components of 

f = Uti 

V 

except the v -component, which we let vary through H Vo , then (1.5) reads 

(1.6) ^CjtT^j^f^) - ^ C ^v„(fv„) = c {t(G) - r(G')}f Vo (l), 

where the iTj V0 (and /z„ ) are unramificd unitary representations of G vo . But the 
distribution /„„ — > / Vo (l) is given in terms of tvK Vo (f Vo ) (for all unramificd tem- 
pered 7r„ ) by integration against Plancherel's measure, which is continuous in the 
obvious sense. Similarly, the left-hand side of (1.6) defines a discrete measure (on 
this same unramified dual). Thus it follows from (the uniqueness part of) the Riesz 
representation theorem for measures that both sides of (1.6) must be zero. □ 

Corollary (of Proposition 1.3 and the Proof of Theorem 1.1). For match- 
ing f and f on G(A) and G'(A), 

tvR (f)=tvR' (f), 

where R' is the representation ofG'(A) in the subspace of L 2 (Z(A)G'(F)\G'(A)) 
orthogonal to all one- dimensional invariant subspaces. 



It remains now to complete the: 
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Proof of Proposition 1.3. Because \Sd\ > 2, our assumption on / (that it matches 
/') implies that f v has vanishing (regular) hyperbolic orbital integrals for at least 
two places v\ and V2- Thus by Proposition 2.1 of the last lecture (Lecture V) 

(1.7) tri? (/) = m(Z(A)G(F)\G(A))f(l) 

+ V ™(Z(A)G 7 (F)\G 7 (A) f f{x- l ~fx)dx 

JG y (A)\G y (A) 

7 regular 
elliptic 

- ]T n(f)m(Z(A)G(F)\G(A)). 



On the other hand, for G', the trace formula for compact quotient in Lecture I 
implies 

trR' (f') = m(Z(A)G'(F)\G(A))f'(l) 

+ V m(Z(A)G'JF)\G'(A) f /'(arV) 

r- -,,„^,/„M JG'XA)\G'(A) 



{~/}tnZ>(F)\G>(F) 



]T »{f)m{Z{A)G'{F)\G' 



J, 2 =l 



So to prove the proposition, it remains to check that: (i) the regular elliptic orbital 
integral terms match up on G and G'; and (ii) fi(f) = /z(/') for each character /x. 

As for (i), let {L} run through a set of representatives for the classes of quadratic 
extensions L of F which don't split at any v € Sd- Then each integral 



m(Z(A)G 7 (F)\G(A)) f fix^^dx 

JG y (A)\G(A) 

in (1.7) is of the form 

(*) m(A*L x \L x (A)) f f(g" 1 19)dg 

Jl"(a)\g(a) 

for some 7 7^ 1 in F X \L X . (Note that if 7 belongs to a quadratic extension L which 
splits at some v e Sd, then our hypothesis on / implies 



/ f{9 v 1 19v)dg v = Q, 

JTt...\G„ 



i.e., the orbital integrals for such 7 don't appear in the expression for tri? (/)-) On 
the other hand, each regular orbital integral 

m(Z(A)G'JA)\G'JA)) [ f'ix^^dx 

JG' y (A)\G'(A) 

appearing in the expression for ti R' a (f) is also of the form (*), since quadratic L 
which don't split at v £ Sd are precisely the quadratic extensions of F embeddable 
in D. Thus the regular elliptic orbital integrals indeed match up term by term. 
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As for (ii), let us verify that 

M/«) = M/i) 

for each place v of F. For v ^ Sd this is a tautology. So let us assume ueSfl. By 
Weyl's integration formula for Z' V \G' V we compute 



M/i) - / fv(g)fi v (detg)dg 
Jz v \G' v 

= V i / tf(t)/i(dett)( / f' v {9-hg)dg) dt v 

Jz v \t' t k Jt' t \g'„ ' 



{Lv} 

where {it,} indexes the Cartan subgroup , and 

aia 2 

if i has eigenvalues a\ and 02- But because /„ matches f' v , this last sum equals 
V \ / S(t)fi(dett)( fv(g~ 1 tg) dg) dt v = fv{g)Vv{g)dg 

,t 1 JZ V \T L „ V7V \G„ ' JZv\Gv 



{L v } 



(since the orbital integrals of /„ vanish off the elliptic Cartans). Thus [i v {f' v ) = 
n(f v ), as required, and the proof of Proposition 1.3 (and hence Theorem 1.1) is 
complete. □ 



2. The Jacquet Langlands Correspondence. Let Aq(G) denote the collection 
of irreducible invariant subspaces V n of Ro(g) in Lq(Z(A)G(F)\G(A)). According 
to the "multiplicity one" result of [JL] , proved only using the theory of Whittaker 
models, each irreducible unitary representation 

of G(A) is realizable at most once in L\. Thus the (distinct) spaces V v in Ao(G) 
may be confused with the set of (classes of) irreducible automorphic cuspidal repre- 
sentations of GL(2). Moreover, "strong multiplicity one" (proved using "only" the 
theory of L-functions) implies that V n — V^' as soon as tt v and w' v are equivalent 
for almost all v. 

Now consider the group G' — D x , and the collection Ao(G') of irreducibly invari- 
ant subspaces of R' (g) acting in the subspacc of L 2 (Z(A)G' (F)\G(A)) orthogonal 
to all the one-dimensional invariant subspaces. Then the trace formula implies 
that "multiplicity one" and "strong multiplicity one" hold in Ao(G') as well (even 
though Whittaker models disappear in this setting, and the theory of L-functions 
cannot be developed a la Hecke ... ). Indeed, this is a corollary of the following 
Jacquet-Langlands correspondence between G' and G: 
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Theorem 2.1. There exists a 1-1 mapping 

W — K 

/rom ^4o(G') to Ao(G), with the property that ir v = n' v for all v ^ Sd; the image 
consists of all those n in Ao(G) with the property that ir v is square-integrable 
(mod Z) for all ueSfl. 

Remark. A weaker form of this correspondence asserts the existence of a map 
between the representations n' and it above; i.e., to each it' on G' realizable in 
some G' on Ao(G) there exists a ir in Ao(G) such that ir' v = w v for all v ^ Sd- As 
noted in Lecture I, this can be proved using L-functions but the argument does not 
yield a characterization of the image, nor the multiplicity one results for G' . 
The trace formula proof of Theorem 2.1 starts from the basic identity 

tvR (f)=trR' (f') 

valid for matching / and /'; see the Corollary to the proof of Theorem 1.1 in the 
preceding section. Equivalently, and more suggestively, 

(2-2) ]T tm(f)= ]T tr^'Cf). 

ireA a (G) TT'eA' (G) 

To extract the desired bijection between Vv and tt from the identity (2.2) we need 
to refine this identity to one of the following type: 

Fix any finite set S of places of F containing Sd and the archimedean places, 
and for each v ^ S, fix a given representation 7r" which is unramified; then 

(2.3) En tr ^)=Eii tr <^)< 

ves v^, ves 

with f v <~ f' v , and the sums taken over all V n in Ao(G) (rcsp. W in Ao(G)) such 
that w v (resp. ir' v ) is isomorphic to 7r° for all v ^ S. Note that by strong multiplicity 
one for GL(2), the left-hand side of (2.3) contains at most one term. 

Assuming the truth of (2.3), let us explain the slick fashion in which Theorem 2.1 
can be proved. Suppose first that no V n in Aq(G) corresponds to a given V n * in 
A' (G'). Then (2.3) (with S the set of places outside of which D and tt' v are p-adic 
and unramified, and 7r° = ir' v for v ^ S) implies that the left-hand side of (2.3) is 
zero. Thus we also have 

(2-4) £ tr7r sUs) = °> 

where 

^5 = ®X, f's=U & 

ves ves 

is arbitrary in C%°(Zs(A)\G' s (A)), and the sum is over all elements of A' (G') (like, 
for example, TV) such that (ir) v = 7r° for v S. But (2.4) contradicts a well-known 
result of local harmonic analysis known as "linear independence of characters" (see 
Lemma 16.1.1 of [JL] or Lemma 5.11 of [Rol]). 
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Similarly, we can begin to characterize the image of this correspondence V n > — ► n. 
Indeed, suppose n = <gnr v is in the image of this correspondence, but ir v is not 
square- integrable for some vo in Sd- Then it follows 

tTTT Vo (f Vo ) = 0, 

since n Vo would then be induced, and f Vo <~ f implies that f Vo has vanishing 
hyperbolic integrals. Thus the left-hand side of (2.3) would vanish, leading to the 
same contradiction as above. 

To complete the characterization of the image, we need to recall some facts 
about the square-integrable representations of G v and G' v . In particular, let ( , ) e 
denote the inner product for the space of class functions on the set of regular elliptic 
elements of Z V \G V or Z V \G' V defined by 

~ JZ V \TI" S 

the sum extending over the conjugacy classes of compact tori of G v (conveniently 
confused with those of G' v ). For G' v , the Peter- Weyl theorem and the Weyl integra- 
tion formula imply that the characters of the irreducible representations of Z V \G' V 
comprise a complete orthonormal set with respect to ( , ) e . On the other hand, 
for G v the so-called "orthogonality relations for square integrable representations" 
imply that the characters of the square-integrable irreducible representations of 
Z V \G V comprise at least an orthonormal set with respect to ( , ) e Thus, given 
any square-integrable representation tt v on G v , v £ Sd, we can determine an irre- 
ducible 7r" on G' v by the condition 

we also fix f' v in (Z V \G' V ) such that 

tTTr'(f') = ( 1 if W ' v ~ 7r "' 
v v \ otherwise. 

Namely, we take f' v = x K >- 

So suppose now that ir in Aq(G) is such that ir v is square-integrable for each 
ueSfl. Then the trace formula identity (2.3) implies that 

(2.5) n°'« = E 1 ' 

ves v„, 

with the sum taken over V n > in A' (G) such that ir' v ~ 7r" for v in 5, and n' v ~ ir v 
for v $l S. Indeed, for neS, 

tr7r„(/„) = / f(g)Xn v {9)d9 

JZ v \Gv 

{n} Jz v \Tl** 

= Ei/ 6(t) X , v mMdt 

(Xn v > Xir'j)e a TTv7 

with 

M.Q = I Mg-hg) d 9 = f fUg-'tg) dg = $ t (/J). 

Jt v \g v Jt v \c v 
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Claim. The sum on the right side of (2.5) has exactly one term in it. 

Indeed, if there were none, the left-hand side of (2.5) would be zero (which is 
impossible by our assumption in a %v , whereas as if there were two or more terms, 
we would have 

(again impossible, since each |a w J = |(X7r„,X<)e| < ||XnJ|e||x<l|e < 1)- Thus we 
conclude there must be exactly one V w > in A' (G') corresponding to it in Ao(G), 
and the theorem is finally proved. 

2. Embedding of Local Discrete Series in Cusp Forms. 

Theorem 2.1. Let V be any finite number of finite places of F , and w v a dis- 
crete series representation of Z V \G V for each v in V. Then there exists a cuspidal 
automorphic representation 7r of G(A) such that tt v = (tt) v for each v in V . 

Proof. We can assume that for at least one place v\ <G V, ir Vl is actually supercusp- 
idal (by adding such a pair (vi,ir Vl ) into the collection {v, n v } if necessary). Before 
applying the simple trace formula of Delignc-Kazhdan to this situation, we need to 
recall two important facts about local harmonic analysis on G v . 

For G v non-archimedean, a function f v in C^°(Z V \G V ) is called a pseudo- 
matrix coefficient for the discrete series representation tt v of Z V \G V if tr ir v (f v ) = 
1, but tvr v (f v ) = for any tempered irreducible admissible representation t v not 
equivalent to ir v . That such pseudo-matrix coefficients exist in the generality of a 
general reductive G is a theorem of Bernstein, Deligne and Kazhdan (see [BDK] , 
[Clo] [BDKV]). Moreover, one knows that for such functions /„ and 7 (regular) 
elliptic in G v , 

®t(fv) = / fvix^jx) dx = cx^~(7) 

JTv\Gv 

for a non-zero constant c. In particular, the orbital integral Q^ifv) is not identically 
zero (since Xtt„ ^ on the regular elliptic elements; see [Clo]). 

Remark. Suppose tt v is supercuspidal, and we denote one of its matrix coefficients 
suitably normalized by f v (g). Then /„ belongs to C£°(Z V \G V ), 

tr7r„(/„) = l, and 
trT„(/„) = 

for any irreducible admissible t v not equivalent to ir v . Thus the terminology pseudo- 
matrix coefficient is indeed apt. 

Now given V and {ir v } as in the hypothesis of Theorem 2.1, fix 

f = Hf v mC™(Z(A)\G(A)) 

V 

such that (a) for each v G V, f v is a pseudo-matrix coefficient; and (b) for a fixed 
finite place V2 outside V, f V2 is supported on the regular elliptic set of G V2 . Then 
by the simple trace formula of Deligne-Kazhdan, for such /, 

(2.2) ^tr7r(/) = ^m 7 $ 7 (/), 

7T 7 
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where the sum on the left is over the cuspidal automorphic tt on Z(A)\G(A) such 
that (tt) v = tt v for each v e V, and the right-hand side is summed over the regular 
elliptic conjugacy classes in Z(F)\G(F). Thus to prove Theorem 2.1, it suffices to 
prove that this right-hand side (and hence the left-hand side) does not vanish for a 
particular special choice of such f. 
Equivalently, write 

*7(/)=n $ 7(M 

v 

a product of local orbital integrals. It will suffice then to choose / such that 
$ 7 (/) 7^ for a particular elliptic regular 70, but is zero for all other 7. 

To this end, first choose 70 and f V2 (still assumed to have support in the regular 
elliptic set) such that 

$70 (fv) f° r cacn v inVU {v 2 }. 

(Recall that for each pseudo-matrix coefficient, $ 7 (/„) = cx-K V (l) is n °t identically 
zero on the elliptic regular set of G v ... .) This particular choice of 70 will intersect 
the maximal compact K v of G v for almost all finite places outside VU {«2}- Let us 
denote this (infinite) set of places by V* , and for each v in V* set f v equal to the 
unit element of H v = H(G V , K v ). Finally, choose f v at the remaining finite places 
(outside V U {v 2 } U V*) to be such that $ 7o (/„) ^ 0. 

With this refined choice of / = f\ fv (still arbitrary at the infinite places!) it is 
clear that 

(2.3) * 7 (/)^0 

for some elliptic regular 7 only if the coefficients of the characteristic polynomial 
of 7 are v-integral for each v in V*, i.e. are rational with uniformly bounded 
denominator. But such rational numbers lie in a lattice of F. Thus we can choose 
f v at the archimedean places to have support so small near 70 as to intersect the 
conjugacy class of no other 7 satisfying (2.3). With this choice of / = \\f v , 
$ 7 (/) ^ if and only if 7 = 70, and the proof is complete. □ 

Concluding Remark. Crucial to our argument was the fact that the pseudo- 
matrix coefficients /„, v G V, eliminated from (2.2) all cuspidal t = ®t„ with 
t v not equivalent to tt v for v in V. This worked in our case since the only non- 
tempered t v which might satisfy tvT v (f v ) ^ is the trivial representation (when 
tt v is the Steinberg representation), and such a representation can never occur as 
a local component of a cusp form on GL(2). In general, an additional argument 
is necessary, involving some kind of limit- multiplicity argument (due to de George- 
Wallach); see [Rol] or Appendix 3 of [BDKV] . 
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Lecture VII. (G, M)-Families and the Spectral J x (/) 



The transition from Arthur's first ("coarse") form of the trace formula to a more 
explicit ( "fine" ) expansion results from formal operations on collections of functions 
called (G, M)-families, operations which pervade the "fine o-expansion" as well as 
the "fine x-expansion" . 

The purpose of this lecture is to describe some basic examples and properties 
of (G, M)-families, and to explain the pivotal role they play, especially in making 
explicit the spectral ramified terms J x (f). 

1. Definitions and Basic Examples. Fix a Levi subgroup M of G, and let V(M) 
denote the set of not necessarily standard parabolic subgroups of G for which M is 
the Levi component. 

Definition 1.1. 

Suppose that for each P in V(M), cp(A) is a smooth function on ia* M . Then the 
collection 

{c P (A): PeV(M)} 

is called a (G, M)-family if the following condition holds: if P and P' are adjacent 
groups in V(M), and A belongs to the hyperplane spanned by the common wall of 
the chambers of P and P' , then 

(*) cp(A) = Cp ,(A). 

Remark 1.2. The compatibility condition (*) here is equivalent to the property 
that whenever P and P' are elements of V(M) contained in a given parabolic 
subgroup Q, and A belongs to idg, then 

Cp (A) = cp,(A). 

In particular, let 

T{M) 

denote the parabolic subgroups of G whose Levi components contain M; then for 
each Q in T{M) one may define a smooth function cq(A) on zOq C ia* P through 
the formula 

cq (A) = cp (A), 
for any parabolic P in V(M) contained in Q. 
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Example 1.3. Let {Xp: P E "P(M)} be a collection of points in om with the 
property that for each pair (P,P') of adjacent groups in V(M), Xp — Xp> is per- 
pendicular to the hyperplane spanned by the common wall of the chambers of P 
and P'\ pictorially: 







Xp 









Figure 3 
Then the resulting family of functions 



c P (A) 



A(X P ) 



on ia* M is a (G, M) -family. Indeed for A on such a hyperplane, 



cp (A) 
cp' (A) 



= e A(X P -X pl ) = L 



Remark 1.4. The similarity here with Figure 2 of Lecture IV is not coincidental. 
Indeed, for any fixed x in G(A), the points 

{-H P (x): PeV(M)} 

comprise an "orthogonal family" of points as depicted in the figure above; this 
is proved in Lemma 3.6 of [A8], where such "AM-orthogonal" families of points 
were first introduced. (N.B. A collection of points {Xp} in Om comprises an Au~ 
orthogonal family if an only if Xp — Xpi is a multiple of the coroot associated to 
the unique root in Ap D — A pi).) 

Remark 1.5. If L is any Levi subgroup of G containing M, the more general 
notion of an (L, M)-family is defined exactly as above: for each P in V L (M) (the 
parabolic subgroups of L with Levi component M), cp(A) is smooth on ia* M , and 
if P' in V L (M) is adjacent to P, then cp and epi agree on the wall between the 
corresponding adjacent chambers. 

For the next Lemma, we need to recall a function Op (A) defined on ia* M by 

MA) = (apr 1 II A (" V )' 

aeAp 

where ap is the covolume of a certain coroot lattice. (Again, there is the more 
general notion of 8p for any parabolic Q D P ) 
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Lemma 1.6. For any (G, M)-family {c P (A) : P e V(M)}, the function 

c M (A)= ]T cp(A)0 p (A)- 1 , 
Pev(M) 

initially defined away from the hyperplanes A(a v ) = 0, extends to a smooth function 
on ia* M . 

Proof. The only possible singularities along A(a v ) = occur in terms correspond- 
ing to those P for which a or —a is a simple root. But such groups occur in pairs 
of adjacent P, P', where cp and cp> agree. I.e., the contribution of these P's to 
c M (A) is 

cp(A) c P .{A) 

MA) MA)' 

where the numerators are equal, and the denominators appear as negatives of one 
another; thus the singularities cancel. □ 

Remark. The functions cm (A) arise naturally as the weight functions of the 
weighted orbital integrals described in Lecture IV. Indeed, consider the (G, M)- 
family 

{cp(A)^e^ x ^;Xp = -H P (x)} 
recalled above. In [A8] it was shown that the function 

c M (A)= ]T e^MA)- 1 
PeV(M) 

equals the Fourier transform of the characteristic function of the convex hull of the 
points Xp (pictured in Figure 3 above). Thus it follows (without appealing to the 
more general Lemma 1.6 above) that cm (A) is a smooth function on ia* M (being 
the Fourier transform of a compactly supported function). In particular, cm(0) 
is defined, and equal to the volume of this convex hull, namely the weight 
function vm(x). 

It turns out that (G, M)-families pervade even more extensively the spectral 
terms J x (f)- Here, as we shall explain below, (G,M) families arise which are 
products of families of the above (geometric) type, with (G, M)-families defined 
in terms of intertwining operators. 



2. Motivation and Examples Arising from the Spectral Distributions 

jj(/). Recall (from Lecture III) that 

4(f) = I A^K x (x,x)dx, 

J Z(A)G(F)\G(A) 

with 
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Here P = MpNp runs over the "associated" parabolics of G, <f> runs thru a if -finite 
basis for p(a, A), and a runs through (classes of) irreducible unitary representations 
of Mp(A) such that functions in Indp^j a belong to L 2 (P) X . 
Formally then, 



D „ Jia*/ia% 



where 



and 



(fi^ (P,A)0) = / A T £(x>',A)A T P(x,0,A)^- 

Jz(A)G(F)\G{A) 

But according to the inner product formula for truncated Eisenstein series, this 
operator fi£ (P, A) equals the value at A' = A of 

(t'V-tA)(T) 

(2.D E E M ft|P (t,A)-^, P y,v) ^ (t>A/ _ tA) . 

Fl i,t' in 

Here Mp 1 | P (f, A) is the intertwining operator defined on Ind(tr, A) by 
/ 4 > {w^ 1 nx) dx 

JN Pi (A)nw t N P (A)w- 1 \N Pl (A) 

(initially for A with large real part, then for any A by analytic continuation). Denote 
by w^ i(7 (P, A) the value of the operator defined in (2.1) at A' = A. Actually, for 
P ^ P x , the operators Cl x t7 (P, A) and u^ jCr (P, A) are only asymptotically equal 
(for large T); this is the thrust of [A9]. 

Claim. 

(2-2) tv(^ a (P,X)p(a,X)(f)) 

may be evaluated by setting A' = A in the sum over s € W(dp, dp) of 

tr( ^ c Q (A)d Q (A)e Q (A)- 1 p(a,X)(f)), 

QeV(M) 

where A = A s = sA' — A, and cq and dQ are two (G, M) families to be described 
below. 

To see this, rewrite the expression (2.1) as 

^ ^ ^ , e (t(aV-A))(T) 

E E E M fl|P M)-M Pl|f ( b ,y) . 

W(ap.ap) 
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For any Pi and t e W(ap, apj, set Q = w^ 1 Piw t for any representative w t of t in 
G(F). Then 

(P u t)~Q 

is a bijcction between pairs which occurs in the sum above and groups Q e V{M). 
It can also be checked that 8p 1 (t(s\' — A)) = Oq(s\' — A); moreover, 

M Pl |p(i ) A)- 1 M Pl | P (i S ,AOeW sA '- A ))( T )=M Q |p(A)- 1 M Q | P ( S ,A')e^'- A )^( T ) ) 

where Mq| P (A) = Mq| P (1, A), and 

Y Q (T)=t~ 1 T 

(or rather the projection of this point onto <Xm)- Thus the expression (2.1) indeed 
equals the value at A' = A of the sum over s in W(ap, op) of 

c Q (A)d Q (A)6 Q (A)-i 

QeV(M) 



with 



(2.3) cq(A) 



e 



A(>q(T)) 



A = sX' - A, 

and 

(2-4) do (A) = (M Q |p(A)- 1 M Q |p( s , A')). 

Remark 2.5. Below we shall see that {cq(A)} and {dg(A)} both define (G, M)- 
families. Thus the product 

{e(A) = c(A)d(A)} 
is also a (G, M)-family, and the corresponding function 

e M (A)= ^ c Q (A)d Q (A)0Q(A)- 1 

QeV(M) 

is smooth on io^-. In particular, the expression (2.2) may indeed be evaluated at 
A' = A (which corresponds to evaluating eu (A) at A = sX — A) . 

Remark 2.6. In (2.3) and (2.4), set A' = ( + A in ia* M , and s = the identity. 
Then the value of ejw(A) at A' = A corresponds to the (operator) expression 



(Yo(T)) M Q\P( X )~ lM Q\P( X ') 



val V e 

A'=A ^ On(0 



i irn V V ^cm Mjw^y'Mjw^ + Q 
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This is precisely the operator we denoted by Mf(a) x in Proposition 3.9 of Lec- 
ture IV. Thus the "weight function" Mp(a) x appearing in the weighted character 
expression for the unramified spectral term Jj(/) in Proposition 3.9 is indeed 
realizable as a special value of eAf(A) for an appropriate (G, M)-family ep. 

N.B. As we observed in the case of GL(2) (in §2 of Lecture IV), the expression 
for J x (f) simplifies considerably for unramified spectral data x, since the sum 
over t, t' in W(op,Op 1 ) in (2.1) need only be taken over t = t', i.e., only the term 
corresponding to s = 1 really appears in the expression for (2.2), and we indeed 
have 

J xif) = E E ^7X1 / tv{Ml{a) x p{a, A)) dX 
as asserted in Proposition 3.9 of Lecture IV. 

Proposition 2.7. Both cq(A) and (1q(A) define (G, M)- families. 
Proof. We have to show that if Q and Q' are adjacent in V(M), then 

Y Q {T)-Y Q ,{T) 

is perpendicular to the hyperplane in ia* M spanned by the wall common to the 
chambers of Q and Q'. Recall that for any Q in V(M), Y Q {T) = t^T if Q = 
w^ 1 P\Wt for some fixed P\ € V(M). But if Q' is adjacent to Q, then Q' — w^, l P\Wt' 
with t' — s a t, and s a the simple reflection corresponding to an a e Ap . Therefore 
Yq(T) - Yq> (T) = t- x {T - s^T), which is a multiple of t' 1 ^ (sec footnote 1 
below). But s _1 a v = /3 V , where /3 V is the unique root in Aq n (— Aq/). Thus 
Yq(T) — Yqi{T) is perpendicular to the hyperplane between Q and Q' . □ 

As for cZq(A), we recall the standard functional equation 

M Q , lP (s, A) = M Q , {Q (1, s\)M Q , lP (s, A), 

valid for s G VK(aM, fljw) and Q € V(M) (see equation (1.2) of [A8]). Using it gives 

d Q ,(A)^M QllP (X)- 1 M Q , lP ( s ,\ / ) 

= Af Q |p(A)- 1 Af Q ,| Q (A)- 1 M Q1Q ( s A')M Q |p( s , A'). 

So it remains to show that for A belonging to the hyperplane common to Q' and Q, 

M QI]Q {X)- l M Q , ]Q {sX') = I. 

For this, sec §§1 and 2 of [A8]. 



formally, a v = 2a/(a,a), and s^ 1 = T - 2(T, a)/(a, a) a. Therefore, for any 0, (J3,T - 
s^iT)) = (P,T)-(/3,T)+2(T,a)/(a,a) (P,a) = (T, o)(/3, o v ), i.e., T-s^T is indeed a multiple 
of a v . 
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3. A Splitting Formula for Products of (G, Af )-families. In the last section, 
we explained how J%(f) is expressed in terms of integrals of special values of the 
functions eM(A), where ep = cp(A)dp(A) is the product of two (G, M)-families. 
The result below is crucial in completing the evaluation of these integrals. 

Proposition 3.1 (see Lemma 6.3 of [A4]). In general, 

(cd) M (A)= c mW q (A), 
Qer(M) 

where T(M) denotes the set of parabolics of G whose Levi component contains (but 
does not necessarily equal) M. 

We will not prove this proposition, since the reader can find it in [A4]. But 
we shall at least define the terms and d'g appearing here, and explain their 
significance. 

Given any (G, M) -family {cp(A)}, and a parabolic Q containing some P in 
V(M), how first do we define the function cq? 
Since io*q C ia* P , it is tempting to simply set 

c Q (A) = c p (A q ) 

with Aq the projection of A onto iOg. But is this well-defined? If P' is another 
element of V(M) contained in Q, we must check that 

c P /(A) = cp(A) 

as soon as A e ia*Q. But we have already remarked that the compatibility condition 
on the (G, M)-family {cp} implies this is the case (Exercise!), and thus cq is well- 
defined on ia*Q. 

Now for any pair of parabolics Q C i?, define 

^(A) = (a«)- 1 J] A (* V )> 

with w v in Oq defined by 

(and the constant a,Q the covolume of a certain lattice ... ) . Then the functions 

c' Q (A), QdP 

are defined (initially on the complement of a finite set of hyperplanes in ia* P ) by 
c' Q (A) = Yl (-l) d[m{AQlAR) ^(A)- 1 c R (A)9 R (A)-\ 

{R: RDQ} 

In Lemma 6.1 of [A4], it is shown that these functions extend to smooth functions 
on all of iap. 
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Now that the functions d'g in Proposition 3.1 are defined, there remains the 
simpler task of defining the c M (A); these are just the cm functions (1.5) derived 
from the following (L, M)-families of functions {c^(A)}. 

Fix Q arbitrary in T{M), and let L — Lq denote its Levi component. By V L {M) 
we denote the set of parabolic subgroups of L whose Levi component is M, and 
for each R £ V L (M), we set Q(R) equal to the unique group in V(M) which is 
contained in Q and such that Q(R) flL = R. Then the resulting family of functions 

{4(A) = c Q{R) (A):R£V L (M)} 

comprises an (L, M)-family, and the functions c M (A) appearing in Proposition 3.1 
are the corresponding CM-functions 

4(A) = E c%{A)0%{A)-\ 

R£V L (M) 



4. GL(2) Revisited. Although we already discussed (in Lecture IV) the evaluation 
of Jj(/) in this case, we wish to return to this calculation now from the more general 
standpoint of Arthur's theory. 

According to the discussion in Section 2 above, Jj(/) is given by the polynomial 

(4.1) W EE {A = Vtr( eM (AHa,A)(/))}dA, 

where e(A) is the (G, M)-family given by the product of the family of functions 
(2.3) and (2.4). When P = B (the Borel subgroup), we have 

fit —it\ 

and 

. _ fit' -it' 

with t' = t + ( and A = s\' - A. 

For x = {(M, /j,)}, the contribution from the term s = 1 reduces to 
(4.2) 

f rflimf 6 ^ 1 "^ 72 e'' C(Tl ' T2)/2 M(w, it^Mjw, i(t + Q) | 

Indeed, the two elements of V(M) are just B and B (the subgroup wBw -1 opposite 
to B), and so 

e M (A) = e B {A)e B (Ay 1 + e s (A)^(A)- 1 
_ c B (A)d B (A) c B (A)d B (A) 
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But (2.3) clearly implies that 

c B (A) = e A ^ = e «m-ib)/2 

and 

c s (A)-e- A ( T ) = e -*( r i- r ')/ 2 . 

Also Ms|b(1,A) = /, so c?b(A) = /, and it remains only to compute d§(A). For 
this, let T w denote the linear transformation <j>(x) — ► <j>(wx) taking Indg a w \ \ wX 
to Inds a | | A . Then it is straightforward to check that 

ds(A) = Ms lB (X)- 1 Ms lB (l,X') 

= (T„Af( W ,A))- 1 (r tu M( U ;,A')) 
= M(tu, it)~ 1 M(w, i(t + 1')), 

and thus (4.2) holds. To continue, there is no need to appeal to the fact that the 
expression in brackets in (4.2) comes from the (G, M) -family {ep}. Instead, one 
can just compute the limit in question to be 

(T 1 -T 2 )+M(-it)M'(it), 

and hence the contribution to Jj(/) (from s = 1) to be 

/OO POO 
tr(p(/i, **)(/) dt+ / tT{M(-it)M'(it)p(n,it)(f)dt. 
=oo J — oo 

This of course agrees with formula (2.5) of Lecture IV, and for \ unramified, it 
accounts for all of J%(f)- 

Now let us concentrate on the more interesting contribution to (4.1), namely the 
one corresponding to s = w. In this case, A = w\' — A, and this equals (—it, it) 
when A' = A. So as above, we compute 

c B (A) - e A ^ = e -»*(Ti-T a ) 
c B (A) = e- A ^ = e **(Ti-T a ) 
d B (A) = M(w,X') 

and 

d s (A) = M(w, A) -1 . 

In other words, for s = w 

|A /V =Atr(eM(A)p(<7, A)(/)| 

e 2tt(Ti-T 2 ) e -2it(Ti-T 2 ) 

= tr(M(w,-ii)(p(/Mi)(/)) gft tr(M(«;,it)(p(/i, «)(/)) — . 

This is exactly the expression (6.28) in [GJ]. But instead of evaluating its integral 
over R using the simple manipulations of [G J] , let us instead appeal to the splitting 
formula for ejw(A). 
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Lemma. For s = w 

e A(T) _ g— A(T) 



') 


-2it 


d B (A) 


-du(0) 




2ii 


d fl (A) 


-d B (0) 



+ 

+ 2it 
with e ±A ( T ) =e=F«*(ri-T ! ,) i 

Proof. According to Proposition 3.1, we need to compute 

c Q M {K)d' Q {K) 

for each Q e T{M). So first fix Q = G. By definition, d G (A) = d B (0) = d s (0), 
and then d' G (A) = do(A). On the other hand, we compute (from the definition of 
c M ) that c M (A) = cm (A). Thus the leading term of (cd)M(A) is indeed 



MO) 



e A(T) _ e -A(T) 



as claimed. 

Now fix Q = B. Then 



4(0) = ^(A)-d s (0) 



-lit 

while c M (A) = c s (A). Indeed, in this case L — M and V L (M) = M\ Similarly 
4(A)d^(A) = ^y^ .^(A), 

and we're done. □ 

Proposition. For T large, the contribution to J£(f) from (the Borel subgroup B 
and) s = w is just 

tr(d s (0)p(0, M )(/)) = tr(M(0, n)p(0, M )(.f )); 

in particular, only the leading term in (cc?)m(A) contributes to J%(f)- 
Proof. 

Let us suppose for the moment that J£{f) can be calculated by taking a com- 
pactly supported Schwartz function B(t) with B(0) — 1, and then computing 

/oo 
tr(wT(t)p(»,it)(f))B(et)dt. 
-oo 

By the last lemma, the contribution from s = w (and P = P) to 
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is 

e 2it(Ti-T 2 ) _ e -2it(Ti-T 2 ) 



lit 



ix(d B (0)p{n,it)(f)) 



i -2it(T 1 -T 2 ) 

tr((d B (A)-d B (0))p(Ai, «)(/)) 
+ 6 -2it 



, 2it(Ti-T 2 ) tr((d s (A)-d B (0))p(p, **)(/)) 
+ 6 -2it 



But for any real T, 



+ — = J e iyx dx 



w -iy J-t 

pOO 

= / XrWe-** dx, 

with xt the characteristic function of [— T, T] . Thus we can write the contribution 
of the leading term as 

lim lim (/ Xa (T)(x) [ e 2Ux tr(d B (0)p(»,it)(f))B(et) dt dx \ 

= lim lim (/ Xa (T){x)F £ {x) dx\ 

= lim ( / F e (a:) dxX = lim F e (0), 
yj-oo J 



where 



F e (t)=tr(d B (0)p(/i,it)(/))B(et) 

= tr(M(0,M)p(M,0)(/)) att = 0, 

since -B(O) = 1. On the other hand, for the second (or third) term, the contribution 
is 

/oo 
e T2lt{Tl ~ T2) FJt) dt 
-oo 

where for each e, 

Fe{t)=t JMt)-d B (o) P (,,it)(m B{ ^ 

I zt2z£ J 

is a smooth Schwartz function on R, whose Fourier transform vanishes at infinity. 
Thus the second and third terms contribute nothing to □ 

Concluding Remarks. (1) In treating the GL(2) calculation of J^(f) from the 
point of view of Arthur's general theory, we have made a big mountain out of 
a small mole hill. However, already for GL(3), there are compelling reasons for 
appealing to the general theory — not just for using the splitting formula for products 
of (G, M)-families, but also for exploiting the compactly supported function B in 
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the calculation of J x (f). Indeed, without using this function B, there is the need to 
directly establish the integrability of a function like ti(M(w, A)p(er, A)(/)) (in order 
to justify applying a Riemann-Lebesgue Lemma as T — > oo). In the case of GL(2), 
this can be checked directly but already for GL(3) the required estimates on the 
growth of M(w, A) are non-trivial (see Section 3 of [Jal]), and in general they are 
simply not available. 

(2) It remains to prove that the above computation (with the insertion of the 
"smoothing function" B(t)) is justified. This is precisely the goal of Arthur's paper 
[A 7] and — we hope — part of the subject matter of Lecture IX. 

(New) Reference 

[Jal] Jacquct, H., The continuous spectrum of the relative trace formula for GL(3) over a 
quadratic extension, Israel Journal of Math. 89, 1-59. 
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Lecture VIII. Jacquet's Relative Trace Formula 



In deriving Arthur's trace formula 

(*) E J o T (/) = E^ T (A 

o X 

we started by restricting the kernel functions 

o X 

to the diagonal subgroup. 

Z(A)G(F)\G(A) x Z(A)G(F)\G(A). 

Then after modifying these restricted kernels, we were able to integrate them and 
obtain the formula (*). The term "trace formula" seemed apt because — at least for 
cuspidal data \ — {{^i r )} with M = G, f£{f) actually represents the trace of 
R(f) restricted to L^. 

The idea of Jacquet's relative "trace formula" is to restrict the kernel functions 
Kf(x, y) to interesting subgroups of the diagonal, and then integrate them against 
possibly non-trivial automorphic forms on these subgroups. Although the re- 
sulting integrals no longer represent the "trace" of anything, there results from this 
new approach a dramatic array of interesting possible applications to automorphic 
forms (See [JLR] and [Jal] for a general discussion, and further references.) 

In this lecture we shall concentrate on a particular example (cf. [Ye]) which 
already shows the power of this method, and indicates how the further development 
of the method is inextricably linked to the general theory developed by Arthur. 
Even in our exposition of this simplest example, we are influenced by Jacquet's 
more general ideas (see [Jal]); moreover, the approach to the computations in §4 
is adapted from joint work with him and Rogawski on similar computations for 
GL(3). 



1. Description of a Base Change Theorem. Fix a quadratic extension E of 
the number field F, and let u>e/f denote the corresponding quadratic character of 
F*\Ap. Let us call a cuspidal automorphic representation tt of GL 2 (A^) stable if 
it is not of the form tt(^i') for any grossencharacter fi' of E. 
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Theorem (cf. [Ye]). Every stable cuspidal tt on GL 2 (Aj?) 7 with central charac- 
ter loe/Fj has a base change lift to a cuspidal representation tte of GL 2 (A,e); 
moreover, the resulting map 

7T ► TTE 

is a bijection onto the cuspidal representations II of GL 2 (A e) which (have trivial 
central character and ) are distinguished with respect to GL 2 (F) 7 i.e., the "period" 

(1.2) / <(>'(h)dh 

Jz(A) GL 2 (F)\GL 2 (A F ) 

is non-zero for some <f>' in the space ofU. 

Remarks, (i) In [La2], Langlands established the existence of a base change lift 
for GL 2 for an arbitrary cyclic extension E of F, and in[AC] these results were 
dramatically generalized to GL„. Both these works characterize the image of base 
change in terms of the Galois invariance of the representations "upstairs" ; this is 
not surprising, since the proofs proceed from a comparison of the ("invariant") 
trace formula over F with the (Galois) "twisted" ("invariant") trace formula over 
E. 

(ii) In [HLR] it is shown that an automorphic cuspidal representation II of GL 2 (E) 
(trivial on its center) is the base change lift of an automorphic cuspidal representa- 
tion 7r of GL 2 (A;?) (with central character uj e /f) if an d only if II is distinguished. 
The argument there combines the base change result of [La2] with properties of 
the so-called Asai L-function L(s,U,Asai) (which has a pole at s = 1 if and only 
if 7T is distinguished ... ). (By contrast, the relative trace formula proof of Theo- 
rem 1.1 (see below) makes no appeal to [La2].) The main thrust of [HLR] was to 
study algebraic cycles on certain Shimura varieties attached to GL 2 over E, whose 
Hasse-Weil zeta functions are computable in terms of 

L(s, II, Asai). 

It was precisely the desire to extend these results to the context of quaternion 
algebras that led to the idea of the relative trace formula introduced in [JLai] . 

(iii) In [JLai], Jacquet and Lai used a comparison of relative trace formulas on 
GL 2 / E and the multiplicative group G' of a division quaternion algebra over E (the 
relative subgroups of the diagonal being the F-diagonal subgroups) to prove that II' 
on G' /E is G'(F) distinguished if and only if its Jacquet-Langlands correspondent 
II on GL 2 /E is GL 2 (F) distinguished. This was applied in [Lai2] to extend the 
results of [HLR] to the case of certain compact Shimura varieties. 



2. Outline of the Proof of Theorem 1.1. It is convenient to use the notation H 
for the group GL 2 over F, and G for the group Resf GL 2 ( so that H(A) = GL 2 (A^ ) 
and G(A) = GL 2 (A B )). By 

f=n.fv 
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we denote a test function in C^°(H(A),Wp/p), where for almost every place v, f v 
is the identity element of the local Hecke algebra H(H V , , uj v )- similarly, 

f = UK 

belongs to [Z (Ae\G(Ap)) , with f' v almost everywhere the identity in 

H(Z Gv \G v ,Kf). 
Corresponding to / there is the convolution operator 

R(f)= f f(h)R(h)dh 

JZ(A)\H(A) 

in the space L 2 (H(F)\H(A),uj); it is an integral operator with kernel 

K ti x ,y)= f( x ~ l iv)- 

7 £Z(F)\H(F) 

Similarly /' determines an integral operator R'(f') in L 2 (Zg(A)G(F)\G(A)) with 
kernel 

K f >(gi,g 2 )= f{gi 1 i92)- 

7 eZ G (F)\G(F) 

The proof of Theorem 1.1 will come from comparing certain "relative traces" asso- 
ciated to these kernels, as we shall now explain. 

Step I. Prove that for "matching" / and /', 

(2.1) // Kf(n 1 ,n 2 )ipN(ni 1 n 2 ) dn 1 dn2 

J J[N(F)\N(A F )] 2 

= [ [ K f ,(n',h)^' N ,(n')dn'dh. 

Jz(A)H(F)\H(A) Jn'(F)\N'(A f ) 



Here N (resp. N') is the standard maximal unipotent of H (resp. G), and ?/V 
(resp. ip' N ) the corresponding character of N(F)\N(Ap) (resp. N' (F)\N' (Ap)) 
determined by a fixed additive character ip (of F\A) (with ip' = ip o tv E / F ). 

Remarks, (i) A special feature of this particular relative trace formula approach is 
the absolute convergence of the integrals on either side of the identity (2.1). On 
the left hand side, this is clear from the compactness of the domains of integration; 
for the right side, this follows from the fact that for sufficiently large T, 

[ A^K r (n',h)il>' N ,{n')dn' 

JN'(F)\N'(A) 

equals (as opposed to just approximates) 

K f ,(n',h)ip' N ,(n')dn'. 



I- 
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For a proof, see Proposition 2.1 of [Jal]. 



(ii) At first sight, the equality in (2.1) seems surprising, since the double cosets 

N\H/N and N'\G/H 



which arise on each side do not match up. However, it seems to be a general feature 
of the relative trace formula that the "relevant" orbits — those which survive the rel- 
ative integrations — do indeed match up. In the case at hand, it is the integration 
against ip which allows for the matching up of relevant orbits. Once this is estab- 
lished, the assertion that (2.1) holds for matching / and /' reduces quickly to a 
collection of purely local assertions: two local function f v and f' v are "matching" 
if certain local relative orbital integrals (indexed by identical sets) are equal. 

(iii) It is also crucial to prove a "fundamental lemma" asserting that this local 
matching is compatible with the base change map BC * between the Hecke algebras 
of K v and G v . This map is dual to the natural base change morphism 



from L H = GL 2 (C) x Gal(E/F) to L G = (GL 2 (C) x GL 2 (Q) x Gal(E/F). In 
particular, when f' v belongs to the Hecke algebra of G Vl the function f v in the 
Hecke algebra of H v defined by 



will be a matching function to f' v . For proofs of these matching results, see [Ye]. For 
more general results, and discussion of "relative fundamental lemmas", see [Ja2], 
[JaYe], [Mao], and [JLR]. 

Step II. Prove (again for these matching / and /') that the relative traces of the 
continuous parts of the kernels Kf and Kf> cancel each other out. 

To explain what this means, let us first write out the spectral expansion of Kf 
in crude form: 

K f (x, y) = K s , cont + K fiSp + K ffi , 

where Kf^ CODt denotes the kernel of R(f) restricted to the continuous spectrum of 
L 2 (H(F)\H(A) 7 u; E f F ), and Kf^ sp the kernel for the discrete non-cuspidal spec- 
trum. Similarly, 

Kf(gi,g2) = Kf', cont + Kf,s P + Kffi- 

A simple but key observation is that the sp-part of the kernel contributes nothing 
to the relative trace. Indeed, 



BC : (g,cr) 



(f v y(t) = (f v y(Bc(t)) 




since these parts of the spectrum are non-generic. 
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Thus, if we can show that the continuous kernels contribute roughly equal relative 
traces, we might then conclude 

(2.2) J J Kffl(m, n 2 )ipN(ni 1 n 2 ) dn\ dn 2 = jj Kf fi (n ,h)ip N <(n) dn dh 

Henceforth, we denote the maximal compact of H v by K v and the maximal 
compact of G v by K' v . 

Step III. Use the equality of the relative cuspidal traces on H and G to establish 
the asserted base change bijection between the cuspidal (stable) 7r on H and the 
cuspidal (distinguished) II on G. 

To see why this is relatively (!) straightforward, let us write out the explicit 
spectral expansions for Kf$ and Kfi^, namely, 

K ffi (x,y)= J2 E <f)^)W), 

cuspidal 

and 

K f ,, Q (g u g 2 ) = J2 E W)<l>{9iWW)- 
n 0'eVn 

cuspidal 

If we assume / and /' are right K s = Y\ v as K v (resp. K' S ) invariant, where S is 
a finite set of places (including the infinite ones), then these identities read 

(2.3) K ffi (x,y)= ]T (/ S ) v (t(/))EI1^(/")^ 
and 

(2.3') K f , fi (g u g 2 ) = £ (/' 5 ) V WII 5 )) £ [] IL^)^)^)- 

n K ' s V{o} {4>' u } veS 

If we further assume that for v ^ S (where both f' v and f v are spherical), /„ 
corresponds to f' v via the natural base change map between their Hecke 
algebras, then, 

(/TW^)) = (/' 5 ) V (BC(t(rr s ))). 

So plugging these expressions into the relative trace formula identity (2.2) yields 
the more explicit identity 

(2.4) ^A(^,/ s )(/' 5 ) v (BCW7r s )) = ^i?(n,^)(.f' S ) v Wn s )) 

n n 

with 



B(nj' s )=^wl {f , w (e)DW), 



IK} 

W$(e) = / <fi(n)ipN(n) dn, (with a similar expression for ) 

JN(F)\N(A) 
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and 



D(<f/) = [ 4>'{h)dh. 

Jz(A)H(F)\H(A) 



I Z{k)H(F)\H 

Note that (2.4) may be viewed as a (absolutely convergent) linear combination 
of irreducible characters of the group (G) s — Yl v( ^ s G v . Also, note that a particular 
cuspidal Y[ on G(A) will appear on the right side of (2.4) only if it is distinguished, 
i.e., D((f>') 7^ for some (f>' . Thus it is not surprising that a "linear independence of 
characters argument" yields the desired bijection of Theorem 1.1. For more details, 
we refer the reader to [Ye] , and §4 below. 

Since the most subtle part of the proof outlined above is Step II, we focus on 
this step in some detail in the sections below. 



3. Step II: Subtracting off the Continuous Spectrum. Once we know that 
the relative trace formulas on H and G agree for matching / and /', it follows 
(keeping in mind the previous discussion) that 

(3.1) jj Kf i0 (m, n 2 )^N{n^ 1 n 2 ) dn\ dn 2 - J J Kf> i0 (n' ,h)ipN'(n') dn' dh 

= J J Kf,cont(ni,n 2 )^N(n^ 1 n 2 ) dni dn 2 - J J Kf> tCont (n' , h)ip N i(n') dn' dh. 

"Subtracting off the continuous spectrum" from the initial relative trace formula 
identity thus amounts to being able to prove that both sides of this last identity 
are identically zero. 

According to (2.3) and (2.4), the left hand side of (3.1) reduces to the discrete 
expression 

£ A(tt, / s )(/' S ) v (BC(t(^)) - ]T B(H, f' s )(f' s nt(n s )). 
7T n 

So the natural strategy should be to show that the right hand side of (3.1) can be 
written as the difference of two absolutely convergent expressions of the form 

(3.2) J $(a)(f' S y(a)da 

with $(cr) an intcgrable function on the unramified unitary dual of G s . In fact it 
turns out that the right-hand side of (3.1) contributes a discrete as well as con- 
tinuous distribution (i.e., the continuous spectrums cancel each other only modulo 
an "unstable" piece of the cuspidal spectrum). Nevertheless, it is possible (after 
rearranging terms) to apply the trick of Langlands from [La2, p. 211]: since (3.1) 
amounts to an equality between an "atomic" and a "continuous" measure on the 
unramified unitary dual of G s , both sides must be identically zero. 
Consider for example the expression 

(3.3) // K ftCont (n 1 ,n 2 )tp N (n^ 1 n 2 )dn 1 dn 2 , 

J J[N(F)\N(A)] 2 
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where 



Kf,c<mt(x,y) = K x( x >v) 

X={(M,n)} 



/OO 
E(x, p{p, it){ f)4>, it, (J,)E(y, <f>, it, p) dt. 
-oo 



Because the integration is over a compact domain, it is easy to compute that (3.3) 
equals 

/oo 
(f s ) v (p,it)c(f s ,p,it)dt. 
-oo 



ft 

unramificd 
outside S 



where (p,it) denotes the conjugacy class in L G S attached to Ind^ s || rf , 
c(f s ,p,it) = ^ W(p(Ms,i*)(/s)0,^, fj,,it)W((t>,ip, p,it), 



and 



W(cfi,ip, fJ-,it) = / E(n,4>,it, fj,)ipN(n) dn. 

J N(F)\N(A) 



' N(F)\N(A) 

Moreover, because f s "matches" f' S , we have 

(3.5) (f s y(n,it) = f s y(BC(n,it), 

with (BC)(p, it) = (p',it) and p'(z) = p(zz). Thus the contribution from Kf_ cont 
to the right hand side of (3.1) is indeed of the form (3.2). 
Now consider the relative trace formula expression 

Kf (n' , h)tpN' (n') dn' dh 

Z(A)H(F)\H(A) J N'(F)\N'(A) 

corresponding to the quadratic extension. As we already hinted (and shall explain 
in the next section) , this turns out to produce discrete as well as continuous sums 
of traces (f ,S ) v (cr). 



4. Step II Continued: Periods of Truncated Eisenstein Series, and the 
Continuous relative trace over G. Our purpose here is to describe some of the 
difficulties involved in proving the following result: 

Theorem 4.1. The contribution of the continuous spectrum to the relative trace 
formula for G is 

E [°° (f s nn',it)c 1 (f s ,n',it)dt+ ]T (f s y(u',o)c 2 (f s ,p>,o). 
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Here both expressions are absolutely convergent, 

/oo 
\ci{f s ,lj! ,it)\dt < oo. 
fj,' p -°° 

As we already explained in the last section, this theorem is exactly what is 
needed to complete the proof of Theorem 1.1. Indeed, if we subtract the integral 
expression here from 

JJ Kf^ OD t{n l7 n2)tpN(n^ 1 n 2 )dn 1 dn 2 
in (3.1) we obtain (using(3.4) and (3.5)) the expression 

(4.2) ]T [°° {f' S M,it)c(f' s ,n',it)dt. 

On the other hand, bringing the discrete sum in (4.1) over to the left-hand side 
of (3.1) yields the expression 

(4.3) J2 A(?r,fs)(f' S )(BC(t(ir s )) 

7T 

cuspidal 

J2 B(nj> s )(f s y m s )) 

II cuspidal 

- E (/' s W,o)c 2 (/U',o). 

Hence we indeed can conclude (by the arbitrariness of (f' S )) that both (4.2) and 
(4.3) are identically zero. 

Remark. For any given grossencharacter // of E which does not factor through the 
norm map, let 7r(/^') denote the cuspidal representation of H(A) whose Langlands 
parameter t 7Tv almost everywhere base change lifts to the Langlands parameter of 
Indg, n' v . These ?r(//) are the cusp forms constructed by Hecke and Maass (using 
either the theory of theta- functions or L- functions) . The fact that (4.3) equals zero 
gives (by "linear independence of characters" ) an alternate proof of the existence 
of such forms, as well as establishing the bijection asserted in Theorem 1.1. 

Now let us return to Theorem 4.1, giving a concrete expression for the relative 
trace of R CO nt{f')- As we have already noted — for a(T) sufficiently large, 



/ / K f , o Jn',h)Tp N ,(n')dn'dh 

J Z(A)H(F)\H(A) Jn'(F)\N'(A) 



I(F)\H(A) JN'(F)\N'> 

r i 

h-2 Kf>, cont(n' , h)^' N {n) dn 1 dh. 



Moreover, it follows from Propositions 2.3 and 2.5 of [Jal] that this last expression 
equals 

f f A l K x . h ) *I>n> (n' ) dn' dh 



*={(M>')} 
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and, ultimately, that 

(4.4) / f K f , o Jn',h)ip N ,(n')dn'dh 

JZ(A)H(F)\H(A) JN'(F)\N'(A) 

=e r e ( / ate ^ **) w. «) 

.., J-oo tll , Z(A)H(F)\H(A) ' 



x W(<j/,ip,n',it){e)dt. 

Thus our task is to obtain an explicit expression for the right-hand side of this last 
equation, as T — > oo (i.e., for T sufficiently large ... ); in this we follow [Jal]. 

Remark. Introducing a truncation operation facilitates the explicit calculation 
of the relative continuous spectrum of G but, as with the ordinary trace formula, 
also produces "discrete traces" from within the continuous spectrum, as we shall 
now explain. 

In (4.4), the sum is over the (classes of) unitary characters // of E*\A% inducing 
the Eisenstein series 

E{g,<t>',fj-',s)= <t>n{ia), 

y€B'(F)\G(F) 

with <^ e Ind //|| s , and B' — Resf B = M'N'. In computing the periods 
P((f/, n', it) = [ A T E(h, pfr, it)(f)4>', M ', it) dh, 

J Z(A)H(F)\H(A) 

one appeals to the Bruhat-type decomposition 

(*) G(F) = B'(F)H(F) (J B'(F) V H(F), 

where r\ in G(F) satisfies r/f]^ 1 = w. (One can take r\ = ^ ^ if y^r generates E.) 

The proposition below (see [JLai] and [Jal]) is to be viewed as a "relative analogue" 
of the Langlands formula for the inner product of two truncated Eisenstein series. 

Proposition 4.5. For each unitary character // of E*\A* E , set 

m(F*\A F ) if n' \ f ^ f = 1; 

otherwise, 

and 

W) = { m(A ^\ A ^ lf ^ a) = ^ 

\ otherwise, 
Then for sufficiently large a(T) , 

V (<£>', it) 

ff) it(Ti-T 2 ) f e -it(T x -T^) r 

= W)\ ; / </J(k)dk ; / M(w,it)<p'(k)dk\ 

I it J K it J K ) 

+ 5 2 (p ) [ <f>'(rih)dh 

JTti(A)\H(Af) 

with 

T n = r]~ 1 B'(F)r) Q H(F). 



hit* ) - 
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Remark. The <5i-term here corresponds to the contribution from the "small-cell" 
part of (*), whereas the ($2-term corresponds to the "big cell" (w ^ e). Note that 
the second term is independent of the truncation parameter T. Also, all terms 
must be understood in the sense of meromorphic continuation from s in some right 
half-plane to s — it. 

Sketch of the Proof. From the definition of A T , and the formula for the constant 
term of E, one computes that 

(4.6) A T E(g^',p',s) = ]T cj>'( 7 g)(l - r B {H{ 19 ) - T) 

7 £S'(F)\G(F) 

-M( w , s )0'( 7ff )T B (tf)(7.g)-T). 

(This is a special case of Arthur's "second formula" for A T E; see Lemma 4.1 of 
[A3].) The next step is to check that (for s in the domain of convergence of the 
Eisenstein series), the series 

E 1^(^)1 I 1 - r B (H(l9) -T)\ + \M(w, s)<j>> ' {ig)\r B {H{ lg ) - T) 

7£B'(F)\G(F) 

is integrable over Z(A)H(F)\H(A). This justifies the term by term integration of 
(4.6) over Z(A)H(F)\H(A); sec [Jal] §3 for the case of GL(3). Then the decompo- 
sition (*), together with Iwasawa's decomposition for H(A), ultimately yields the 
Proposition. Implicit here is the fact that for a(T) sufficiently large, 

T B {H(r]h)-T) = for all h; 

this requires some proof (see Proposition 7.2 of [Jal] for a more general setting), 
and explains why the contribution from r] is indeed independent of T. 

In applying Proposition 4.5 to the calculation of 
(4.1) J J K f ,, cont (n', h)1> N , (n') dn' dh, 

let us first consider the contributions from characters y! with ^(m') 7^ 0- These are 
precisely the grossencharacters of E which factor thru the norm map from F. If in 
addition Si(/i') = 0, then Proposition 4.5 implies that the contribution of such p! 
to (4.1) is 



/OO _ 



)W(4>' ',Vy ,it)dt 



where 



6 f (v',it,<p') = 6 2 (n') / p{fj!,it){f)<l/{ v h)dh. 
Jt v (a f )\h(a f ) 



Note we can rewrite (4.7) as 

/OO 
{f s y{p',it)c f ,{p!,it)dt 
-OO 
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with 

C fs (//, it) dt = J2 W(4>', V, /A it)6 f > s (p', it, </>')• 

0' 

Now suppose ^2 (a*') = 0, i.e., // docs not factor thru the norm map from F. 
Then Proposition 4.5 implies that the contribution of each such p' to the relative 
trace (4.1) is 

(4.8) f^wtf^n'jty 

J <p> 

( it{T x -T 2 ) r it( Tl -T 2 ) r } 

W)\ / f<f>'(h)dh / M(w,it)f(j>'(k)dk\ dt, 

I it J K it J K J 

with 

/</>' = P {p' ,it){f)4>' . 

Notice the similarity here with the formula encountered for the ordinary trace 
formula term J^(f) m §4 of Lecture VII. There we computed the limit as T — > oo 
by applying Arthur's splitting formula for products of (G, M)-families. Here we 
can do the same thing, taking (as before) 

c B (it) = e»*(ri-T 2 ) and Cs{lt) = e -2u( Tl -T 2) ^ 

But now we set 

d B (it)=W(<j>',iP,p',it) f f4>'{k)dk 

JK 

and 

d B {it) = W{cf>',ip,p',it) j M(w,it){f<f>')(k)dk. 



<K 

It must of course be checked that this d(A) is also a (G, M)-family (not obvious!). 
The main task, though, is to compute 



lim lim y^(cd)M(it)B(et) dt, 



with B{t) a compactly supported Schwartz function as in Lecture VII. Computing 
as before, we find that 



d' G (it) = d B (o)(= d B (o)) = w{4>',i>,n',o) / p(/i',o)(/)</» , (fc)* 

JK 

with (j)' in Ind^t', and — ultimately — that the above limit equals 

<$> J 

Thus we find that the total contribution (to the relative trace formula) from char- 
acters pi of this form is 

£ if' s no,p')c(f s ,p') 
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with 



C(^/' s ,m') = EW^,0,m') f P {Q^' s ){fsW{k)dk. 



It remains to observe that when 62 (//) ^= and Si(n') 7^ 0, the contribution to 
the relative trace formula is again just (4.5). There are apparent contributions 



2ii(Ti-T 2 ) r -2it{T t -T 2 ) r 

■ / f<f>'(k)dk / M(w,it)<f)'(k)dk 



it 



from V(4>' > m'j but they integrate to zero over R by way of a Riemann-Lebesgue 
Lemma (just like the terms in cdM(t) not belonging to the leading term Q = G in 
T(M) at the end of the last Lecture). We spare the reader the details. 

Concluding Remarks. A host of natural problems present themselves in connec- 
tion with any general development of the relative trace formula, especially as far 
as applications are concerned. In addition to the obvious problem of developing 
general local "matching" theorems (and related "fundamental lemmas" ) , there are 
these problems from the "spectral side" : 

(1) Develop a general formula for the period of a truncated Eisenstein series, 
analogous to Langlands' formula for the inner product of truncated Eisen- 
stein series; 

(2) Apply a general theory of (G, M)-families and smoothing functions B(X) 
to the calculation of the contribution of the "continuous spectrum" to the 
relative trace formula, and 

(3) Prove in general that the resulting spectral expressions are sufficiently con- 
vergent to justify subtracting them off from the full relative trace formula. 
For the case of GL(2), this can done by direct estimates on the intertwining 
operators, but already for GL(3) such a direct approach is problematic. 
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Lecture IX. Some Applications of 
Paley-Wiener, and Concluding Remarks 



In Lecture VII, we explained how (G, M)-families arose in Arthur's explicit calcu- 
lation of the general spectral terms J x (f)- We also alluded to why a Paley- Wiener 
theorem was needed to complete this calculation, and promised to return to it. In 
fact, this Paley-Wiener theorem is also crucial in handling other convergence prob- 
lems related to the trace formula. So our purpose now is to finally touch on these 
matters, albeit briefly. This being the last lecture, we shall also conclude with a 
few remarks orienting the reader to some papers of Arthur's that we so far haven't 
even mentioned. 

1. The Paley-Wiener Theorem. The Fourier transform of a compactly sup- 
ported smooth function / on 1 extends to an entire function F(A) on C, and 
satisfies a well-known growth condition, namely that there exists a constant N 
(depending on the support of /) such that 



for any integer n. According to the classical Paley-Wiener Theorem, these prop- 
erties characterize the image of the Fourier transform on the space of such test 
functions /. 

For a real reductive group G, let (G, K) denote the "Hecke algebra" of in- 
finite compactly supported smooth functions. Then for any / in C^°(G,K) and 
irreducible admissible representation ir of G (on a Banach space U^), the Fourier 
transform of / at n is defined to be the operator 



has as its domain the set of irreducible (admissible) representations of G, and for 
any (ir,U n ) takes values in the space of operators on U n . The purpose of the 
(operator) Paley-Wiener Theorem of [A10] is to characterize which functions 



sup{|F(A)| 



e- !V l Ba ^l(l + |Im(A)|) n } <oo 




The resulting function 



7T -> 7T(/) 



7r i ► F(ir) 



are of the form tt 7t(/) for some /. 
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More precisely, let us (following Arthur) fix a minimal parabolic subgroup B of 
G with Langlands decomposition N A Mq, and consider the nonunitary principal 
series of induced representations /9b(ct, A), indexed by quasi-characters A of A and 
irreducible representations a of M 1 . By a well-known theorem of Harish-Chandra, 
any 7r is equivalent to a subquotient of some Pb(&, A). This means that 7r(/) will 
be completely determined by the Fourier transform 

/: (a,A)^/ s (a,A)=p s (a,A)(/). 

What should the image of this Fourier transform be? 
Arthur defines PW(G, K) to be the space of functions 

F: (er, A) >—> F b (<t, A), 

which are entire, if -finite (in a sense I shall not explain here), and satisfy the growth 
condition 

(1.1) sup{||F(a,A)|| e - Ar ll Rc ( A )ll(l + ||Im(A)||)"} <oo, 

er.A 

for some N (and all n). Moreover, there is another, more complicated condition for 
F(a, A) which comes from the various intertwining maps between principal series. 
Namely, if there is a relation 

m 

^D k (p B (a k ,A k )(g)u k} v k ) = 
k=i 

valid for all g in G, where D k is a differential operator (acting through the variable 
A, and u k , v k are vectors in the space of p(a k} A k ), then F(ct, A) must also satisfy 
the relation 

m 

(1.2) ^D k (F((j k ,k k )u k ,v k ) = 0. 

k=l 

(Such relations exist commonly, but are difficult to characterize explicitly; indeed, 
a listing of all such relations would be tantamount to a complete knowledge of the 
irreducible subquoticnts of the principal series.) 
The main result of [A10] is that 

/-/>,A) 

provides an algebra isomorphism from C^°(G 7 K) onto PW(G,A'). The conse- 
quence of this result which is actually needed for the trace formula concerns the 
notion of Paley Wiener multipliers. 

By a "multiplier" of C^°(G, K) is meant a linear operator C on C^°(G, K) such 
that 

C(f*g)=C(f)*g = f*C(g) 



LECTURE IX. APPLICATIONS OF PALEY-WIENER; CONCLUDING REMARKS 87 



for all /, g in C^°(G,K) . (The algebra of all such "multipliers" coincides with 
Endjx( G ) (C£°(G, K)). The Paley Wiener multipliers are constructed as follows. 
Set 

(1.3) f) = ii) K © f)o, 

where \)k is a fixed Cartan subalgebra of the Lie algebra of K n Mo, and t)o is 
the Lie algebra of the split component of Mq. (Here Mo is the Levi component 
of a fixed minimal parabolic subgroup of G.) If © is the Lie algebra of G, then 
f)c is a Cartan subalgebra of and f) C f)c is invariant under the Weyl group 
W of (C5c, t)c)- Let £(i)) w be the algebra of compactly supported distributions on 
f) which are invariant under W. Then for any 7 in £(t)) w , the Fourier-Laplace 
transform 

7("). "€f)£ 

is an entire function on f)£ which is VF-invariant and satisfies a growth estimate of 
the form 

(1.4) sup(\7(is)\e- N ^ Rc(, ^(l + \\lmv\\- n -*) <M 

for some integers 7V 7 and n 1 . According to the result below, these 7 provide us 
with multipliers C 7 (called Paley— Wiener multipliers). 

Theorem 1.5 (see Thm. 4.2 of [A10]). For each distribution 7 in £(t)) w , and f 

in C^°(G, K), there is a unique / 7 in C^°(G,K) such that 

(1-5) tt(/ 7 ) = 7KM/) 

/or any irreducible admissible representation tt. (Here is the W -orbit in 

associated to the infinitesimal character of ir.) In particular, if we define 

C-i(f) — fn 

then the map 



7 -> C7 7 , 7 e f (f)) 



is a homomorphism from the algebra £(t)) w to the algebra of multipliers 

End u{G) (C™(G,K)). 



Remarks, (i) This Theorem is almost an immediate consequence of the Paley- 
Wiener Theorem described earlier. Indeed, the uniqueness is clear (since (1.5) 
completely determines the Fourier transform of / 7 ). On the other hand, the exis- 
tence of fry results from checking that the function 

F B (a,A) =7(i/ CT +A)/ B (o-,A) 

belongs to PW(G, K), and hence is the Fourier transform of some / 7 ; then since 
7r(/ 7 ) is given by the action of p(o, A)(/ 7 ) on the appropriate subquotient, it follows 
that 

7r(/ 7 )=7K+A)7r(/)= 7 KK(/), 
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as required. 

(ii) That C 7 (f) = / 7 defines a "multiplier" of C™{G,K) follows formally from 
(1.5): C 7 (/ * g) = C 7 (/) * g, since 

4(/*.9) 7 ) = tKM/ * 9) 
= 7KM/M5) 
= 7r(/ 7 )7r(ff). 

(iii) This Paley- Wiener theorem may be viewed as an analogue for real groups of 
the p-adic results of [BDK] and [Ro3] . 

2. Applications of Paley— Wiener to the Calculation of J x (/).. Recall how 
J%(f) wa s defined (for sufficiently large T) to be the integral over the diagonal of the 
truncated kernel function (K x (x, y). Using the explicit formula for Aj (K x (x, y) 
in terms of Eisenstein series, we then wrote 

(2-1) J J(/)=EE/ C(A,/)rfA, 

p a • / »p/ ia G 

where 

<(A,/) = -±- 1 tr{nl <r {P,\)p{v,\){f)), 
n[Ap) 

and Q xo .(P, A) is the operator on the space of p(a, A) defined by 

(fi£„(P,A)0)= / A T E(x,<l>',X)A T E(x, < t>,X)dx. 

JZ(A)G(F )\G(A) 

We stress that the integral formula (2.1) for J x {f) is absolutely convergent, and 
that J%(f) is known to be polynomial in T. The problem is to explicitly compute 
Jj(/) for a conveniently chosen value To of T. 

At the end of Lecture IV, we used Arthur's general theory to derive an explicit 
formula for Jj(/) in the case of G = GL(2). The strategy consisted of three steps: 

(1) plug into (2.1) Langlands' explicit formula for 0£ iO .(P, A) in terms of inter- 
twining operators; 

(2) use the theory of (G, M)-families to rewrite ^^(A, /) in a form still more 
amenable to integration over dX; and 

(3) compute the resulting integrals over A (and show that some of the contri- 
butions are negligible as T — > 00). 

It was in this last step that we resorted to a Riemann-Lebesgue Lemma whose 
application is best justified by way of Paley- Wiener multipliers; indeed these mul- 
tipliers allow for the insertion of a compactly supported function B(X) into the 
formula for J X T, as explained in Theorem 2.3 below. 

For general G, there is an additional, more fundamental reason for requiring 
the Paley- Wiener Theorem in the calculation of J x (/). It derives from the fact 
that Langlands' explicit formula for f^ CT (P, A) is not valid for Eisenstein series 
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induced from non-cuspidal a. (Equivalcntly, it is not valid when P lies outside P x , 
the set of parabolics attached to the collection of pairs (Mb,tb) describing the 
cuspidal datum In these cases, Arthur proved "only" that the "nice" formula 
for fi£ (P, A) is an asymptotic one for large T, valid uniformly only on compact 
subsets of A. 

More precisely, let w^ a {P, A) denote the operator on p(a, A) defined explicitly in 
terms of intertwining operators as the value at A' = A of 

(t'X'-t\)(T) 



(cf. formula (2.1) of Lecture VII for complete details). Then (cf. Corollary 9.2 of 
[A9]) the difference between (P, A)p(A, a) (/)</>, <j>) and (the "nicer" expression) 
(w£ (P, X)p{\,o)(f)(f>,4>) is bounded in absolute value by 

(2.2) r(A)|M| 2 e- e " r ", 

where e > 0, and r(A) is a locally bounded function on ia* P . Thus the integrals of 
these expressions over all of ia* P /ia G need not approach one another as T —* oo. 
In short, the substitution of for f^ CT in (2.1) is simply noi justified. 

Example. For G = GL(2), we explicitly derived Arthur's estimate(2.2) in case 
X = {(M,p)}, with p = (U -1 , and P = G. Namely, in the proof of Prop. 2.7 in 
Lecture IV, we computed the difference of these expressions to be Me~ 2t ^ Tl ~ T2 \ Of 
course, in this case the problem of uniformity alluded to above is a priori absent 
(the variable A being constrained to lie in the space ia* P /ia* G = {0} . . . ). 

In general, the way out of this problem, like the problem of showing certain terms 
to be "negligible" in step (3) above, is by way of Arthur's Paley- Wiener Theorem, 
as we shall now (finally) explain. 

For any B in S(it)* /ia G ) w , and irreducible unitary representation a on M(A), 
define 

Pv(A) = B(iY a + A), 

Here Y a is the imaginary part of the orbit {v a } in f)£ associated to the infinitesimal 
character of er^ (and we have fixed an embedding of any a* P in f)*). Recall that 
{v a + A} is also the orbit in f)£ associated to the infinitesimal character of the 
induced representation p(a oc , A) of Goo . 

Theorem 2.3 (see Thm. 6.3 of [A5]). Suppose B in S(it)* /ia* G ) w is such that 
B(0) = 1. For any e > 0, write B f for the function B € (v) = B{ev). Then 

Jl{f) = limlim I *-( A ' WW dX - 

Remarks, (i) The importance of this result is that it allows us to invoke the as- 
ymptotic formula for ^ CT . Indeed, if we take B in the Theorem to be compactly 
supported, then we can indeed substitute w^ CT for fi£ and write 

(2.4) J X T (/) = lim lim £ / tr(u£ CT (P, A)p(A, <r){f))B%{\) dX 

°° p )<7 Ji**p/i** G 
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(since the error term (2.2), multiplied by such a compactly supported B, will indeed 
approach uniformly in A). 

(ii) For large T, the expression 

E/ C(A,/)S CT (A)rfA 

is asymptotic to a polynomial P T {B) in T. So Theorem 2.3 really says 



or 



4(f) = limP' (5 £ ), 
jj(/) = lim lim J2 I *li\f)Bl{\)d\ 

P,cr 17 

where the limit in T (as in the statement of the Theorem) is interpreted as the 
polynomial P T (B e ) which is asymptotic to the given function as T approaches oo. 
On the other hand, since 

limBJ(A) = lim B(e(iY a + A)) = B(0) = 1, 
the dominated convergence theorem implies 

J x (/) = E / ( A ) rfA ( for T lar s e ) 



P,(7 ■ 



= lim ^ / *£(A)B«(A) dX (T large) 
= T li ^ o ] iin E / *.(A)B^(A)dA 



(with limr^oo interpreted as above). Thus all of the work of [A5] essentially boils 
down to a justification of the interchange of order of two limits (over e and T)\ 

(iii) (Concerning the Proof of Theorem 2.3). The basic idea of the proof is 
explained clearly on pp. 1258-60 of [A5]. The starting point is an application of 
the Paley- Wiener multiplier Theorem 1.5 to the archimedean component of any 
/^-finite function / in C^°(G(A)). For any 7 in £(t)) w , this yields a function / 7 in 
C~(G(A)) such that 

*J(A,/a)=7K + A)^(A,/). 
Using this relation, we write 

J^ih) = E / + A )^ A ' /) dX ( for T lar § e ) 

(2.5) = E / <( A ' /) ( / l(H)^ +X)(H) dH) dX 

P,a Jia p/ ia h Jt > 

= [(j2^(H)e^ H ^(H)dH 
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where 

In particular, if we fix H in f), and set 

sew 

with 7s-i# the Dirac measure at s~ 1 H, then (2.5) implies 

(2.6) jj(v) = \wr E e^-^k^" 1 ") 

sew Pm 

Note here that, for each value of H, the right hand side of (2.6) is a polynomial in 
T (since the left hand side is) . Thus we have constructed a family of polynomials in 
T, call them p T (H), whose value at H = is exactly J%(f) (since then / 7 « = /). 

Now to compute p T (H) at 0, the natural thing to do is to integrate this function 
against an arbitrary Schwartz function /3(H). By the Plancherel Theorem for f), 
the resulting inner product / p T '(H) (3(H) dH can be replaced by one over iif /ia*Q 
involving the Fourier transform 




W dH. 



In particular, f£(f) should be obtained by having (3 approximate the Dirac measure 
at 0, i.e., by using 

(3 t (H) =e- dim( V(3(e- 1 H) 
with e small, and / (3(H) dH = 1; Equivalently, this brings into play 

B e (v) = B(ev) 

with B(0) — 1, and this is roughly how one ends up with the sought-after formula 
J x T (/) = hm V / ^(XJ)B^(X)dX. 

e ™> P a Jia* p /ia* G 

The only problem is that p T (H) is not actually tempered, and hence cannot be 
integrated against any (3\ Thus a lengthy detour is required in Sections 4-6 of [A5], 
involving additional polynomials and analysis (but nothing approaching either the 
Paley- Wiener theorem, or the theorem on the polynomial nature of Jj(/))- 



3. A Final Formula for J2 x -^x (f)' ^ ^ e en d of Lecture VII, we explained 
how, for GL(2), the formula 

Jl(f) = lim Hm f tr(a£(i)p(/i, it)(f)B(et) dt 
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could be computed explicitly using (G,M) families. Now, thanks to formula 2.4, 
we have a similar strategy for computing Jj(/) in general, and the end result is a 
formula directly generalizing the explicit formula 

/CO f-OO 
tr(p(n,it)(f))dt+ / tv{M(-it)M'(it)p{n,it){f)dt 
-CO J — CO 

+ itr(M(0)p( M ,0)(/))+M/)T(G). 

(Recall that the "limit in T" is to be interpreted as the polynomial asymptotic to 
the given integral as T approaches oo.) 

N.B. In Lecture IV, we were interested only in the constant term of the polynomial 
i.e., J%(f) = Jx(f)- I n Arthur's general theory, one fixes on a convenient 
point T (which in many "classical" situations is just zero; see Lemma 1.1 of [A4]), 
and sets 

J x (/) = J?(/). 

Theorem 3.2 (sec Theorem 8.2 of [A6]). Suppose f e C~(Z(A)\G(A)). Then 
jj(/) equals the sum over M G £(M ), L <E C(M), a on M(A) (compatible with 
X), and w in W L (<XM)rcg of the product of 

with 

f \P{M)\- X Y, te(M L (P,\)M(P,w)p(a,\)(f))d\. 

Jia* L /ia G PeV(M) 

Remarks, (i) The exact definition of each of the objects appearing here is to be 
found in Sections 4 through 8 of [A6]. But the point is that they are all more 
or less familiar objects related to the intertwining operators Mq\ p discussed in 
Lecture VII. For GL2, there will be only three terms in the sum above: one for 
M = L = Mo, with w = e and a = p a character of M (this gives the term in (3.1) 
with the logarithmic derivative); one for M = Mq,L = G, w the non-trivial Weyl 
element, and a a character of Mq (this gives the term involving tr(M(0)p(/i, 0)(/))); 
and one with M = L = G, w = e, and p the character /z(det #) of G(A) (this gives 
the contribution ^i(/)r(G)). 

(ii) The proof of Theorem 3.2 consists of two non-trivial steps. First one uses 
(G, M) families to carry out the explicit computation of 

lim" T lmi "EE / < x (A,/)S.(eA)dA) , 

where wJ x (A, /) is a sum of special values of functions ejw(A) for particular (G, M)- 
familics e(A) = c(A)d(A) (involving the intertwining operators Mq\ p )\ this is car- 
ried out in Sections 3 through 5 of [A6] , and the end result is a sum of terms of the 
form 

lira/ E te(M L (P,X)M(P,w)p(\,a)B e „(X)d 

e " >0 Jial/ia* G P£ -p( M ) 
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with B in (if)* / 'ia* G ) w equal to 1 at 0. Clearly the second step must be to show 
that this (limit and) test function can be made to disappear! 

Equivalcntly, by the dominated convergence theorem, one must show that the 
sum 

E / \\M L (P,X)p(X,a)(f)\\dX 

is finite. This is carried out in Sections 6 through 8 of [A6], using normalized 
intertwining operators, and estimates on the scalar functions defining these nor- 
malizations. (Some assumptions are made on these normalizations at p-adic places, 
but at least for GL(n) these assumptions are now proved; see [Shal] and [Sha2]). 

Finally, having reached the exalted plateau which Theorem 3.2 represents, we 
should mention that it is really just the beginning^. In particular it is not yet known 
if the explicit spectral expansion 

(3.3) e J x(f) = E E E / E tT ( M *L( p > a, *, u>, f) d\ 

x x m,l a ■ / *e*»i,/*»G p 

w 

converges absolutely as a multiple integral. (The problem is that the estimates on 
the normalizing factors alluded to above are not uniform in %!). 
In other words, we know that 

Ei J x(/)i<°°> 

X 

and that 

E [\\M L (P,X)p(X,o-)(f)\\dX <oc, 

aeU(M(A)) x J 

but not that this last sum can be summed over all \- This lack of knowledge 
causes problems in applying the trace formula, say to base change, where on needs 
absolutely convergent expressions (in order to use tricks like Langlands' to subtract 
off the non-discrete spectrum). In order to get around this problem, Arthur proves 
certain estimates on the convergence of 

Ei J x(/)i 

X 

which may be viewed as a weak form of the conjectured absolute convergence of 
the multiple integrals (3.3). This involves the infinitesimal characters of the infinite 
components of the representations a appearing in the spectral expansion, and an 
application of the Paley- Wiener multiplier theorem. The end result is a technique 
called "separation of infinitesimal characters via multipliers" which allows one to 
obtain the required equality of discrete traces from the matching of the full traces; 
details appear in [A12], [AC], and [La3]. 
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4. Related Works. Here are some of the topics from Arthur's work which we 
have ignored till now. 

In comparing trace formulas for different groups (for example, in proving base 
change for GL„), one wants to rewrite the basic formula 

(4.i) J2 J °(f) = J2 J x(f) 



in invariant form, i.e., as an identity 
(4-2) £'o(/) = 



where each of the terms is an invariant distribution. The reason for this is that one 
needs to assert the equality of two trace formulas for "associated" (or "matching" ) 
functions, given only by their orbital integrals (which arc invariant distributions 
... ) . Such an invariant trace formula is obtained already in [A4] , where a subtle 
rearrangement and modification of the terms J {f) and J x (f) yields the new col- 
lection of invariant distribution I (f) and I x (f) appearing in (4.2). However, this 
is only the "coarse" form of the invariant trace formula, analogous to the "coarse" 
formula •/<>(/) = T^^xif)- To get the "fine" expansion of the invariant trace 
formula requires the papers [All] and [A12] 

Explicit formulas for I (f) are obtained from the analogous fine expansions of 
J {.f) discussed in [A13] and [A14]. (In Lecture IV we described the exact form 
°f Jo{f) — for unramified o — in terms of weighted orbital integrals; the extreme 
opposite case — of o corresponding to {1} — is the subject matter of [A13], and the 
general mixture of these extremes — is handled in [A14].) On the other 

hand, explicit formulas for I x (f) are obtained by another delicate mixture of the 
techniques of Paley- Wiener multipliers and (G, M)-families, somewhat analogously 
to the case of ^ J\(f)- 

Remarks, (i) In a very recent work, Labesse has explained how one can apply the 
trace formula to the base change problem for GL(n) without first putting the trace 
formula in invariant form. This gives an alternate, simpler approach to [AC]; see 
[La3]. 

(ii) Recall from our discussion of Jacquet's relative trace formula that no truncation 
operator was needed to define the "relative" distributions J (f) and J x (f); hence 
these terms are already invariant! 

(iii) Outside the scope of these "introductory" Lectures are the works of Kottwitz 
and others on the stable trace formula (see [Kot2] and [La5]), and the more recent 
papers of Arthur on unipotent representations, A-packets, etc. (see [A15], [A16]). 
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